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INHERITED AND SOMATIC MUTATIONS OF 
APC GENE IN COLORECTAL CANCER OF HUMANS 

The U.S. Government has a paid-up Ucense in this Invention and 
the right in limited circumstances to require the patent owner to 
license others on reasonable terms as provided for by the terms of 
grants awarded by the National Institutes of Health. 
TECHNI CAL AREA OF THE IHVENTTOM 

The invention relates to the area of cancer diagnostics and ther- 
apeutics. More particularly, the invention relates to detection of the 
germUne and somatic alterations of wild-type APC genes. In addition, 
it relates to therapeutic intervention to restore the function of APC 
gene product. 

BACKGRODND OF THE INVENTION 

According to the model of Knudson for tumorigenesls (Cancer 
Research, Vol. 45, p. 1482, 1985), there are tumor suppressor genes in 
all normal cells which, when they become non-functional due to muta- 
tion, cause neoplastic development. Evidence for this model has been 
found in the cases of retinoblastoma and colorectal tumors. The impli- 
cated suppressor genes in those tumors, RB, p53, DCC and MCC, were 
found to be deleted or altered in many cases of the tumors studied. 
(Hansen and Cavenee, Cancer Research. VoL. 47, pp. 5518-5527 (1987); 
Baker et al.. Science, Vol.. 244, p. 217 (1989); Fearon et al.. Science. 
Vol. 247, p. 49 (1990); Klnzler et al. Science Vol. 251. p. 1366 (1991).) 

In order to fully understand the pathogenesis of tumors, it will 
be necessary to identify the other suppressor genes that play a role in 
the tumorigenesls process. Prominent among these Is the one(s) pre- 
sumptively located at 5q2l. Cytogenetic (Herrera et al.. Am J. Med. 
Genet,, Vol. 25, p. 473 (1986) and linkage (Leppert et al.. Science. Vol. 
238. p. 1411 (1987): Bodmer et al.. Nature, VoL 328, p. 614 (1987)) stud- 
ies have shown that this chromosome region harbors the gene 
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responsible for familial adenomatous polyposis (FAP) and Gardner's 
Syndrome (GS). FAP is an autosomal-dominant, inherited disease in 
which affected individuals develop hundreds to thousands of 
adenomatous polyps, some of which progress to malignancy, GS is a 
variant of FAP in which desmoid tumors, osteomas and other soft tissue 
tumors occur together with multiple adenomas of the colon and rec- 
tum* A less severe form of polyposis has been Identified in which only 
a few (2-40) polyps develop. This condition also is familial and is linked 
to the same chromosomal markers as FAP and GS (Leppert et al.. New 
England Journal of Medicine, Vol, 322, pp. 904-908, 1990.) Additionally, 
this chromosomal region is often deleted from the adenomas 
(Vogelstein et al„ N. Engl. J. Med,, VoL 319, p. 525 (1988)) and carcino- 
mas (Vogelstein et al., N. Engl. J. Med., VoL 319, p. 525 (1988); Solomon 
et al.. Nature, Vol. 328, p. 616 (1987); Sasaki et al.. Cancer Research, 
Vol. 49, p. 4402 (1989); Delattre et al.. Lancet, Vol. 2. p. 353 (1989); and 
Ashton-Rickardt et al.. Oncogene, VoL 4, p. 1169 (1989)) of patients 
without FAP (sporadic tumors). Thus, a putative suppressor gene on 
chromosome 5q21 appears to play a role in the early stages of 
colorectal neoplasia in txjth sporadic and familial tumors. 

Although the MCC gene has been identified on 5q2i as a candi- 
date suppressor gene, it does not appear to be altered in FAP or GS 
patients. Thus there Is a need in the art for investigations of this chro- 
mosomal region to identify genes and to determine If any of such genes 
are associated with FAP and/or GS and the process of tumorigenesis. 
SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a method for 
diagnosing and prognoslng a neoplastic tissue of a human. 

It is another object of the invention to provide a method of 
detecting genetic predisposition to cancer. 

It is another object of the invention to provide a method of sup- 
plying wild- type A PC gene function to a cell which has lost said gene 
function. 

It is yet another object of the invention to provide a kit for 
determination of the nucleotide sequence of APC alleles by the 
polymerase chain reaction. 
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It is still another object of the Invention to provide nucleic acid 
probes for detection of mutations in the human APC gene. 

It is still another object of the invention to provide a cDNA mol- 
ecule encoding the APC gene product. 

It is yet another object of the invention to provide a preparation 
of the human APC protein. 

It is another object of the Invention to provide a method of 
screening for genetic predisposition to cancer. 

It is an object of the invention to provide methods of testing 
therapeutic agents for the ability to suppress neoplasia. 

It is still another object of the Invention to provide animals car- 
rying mutant APC alleles. 

These and other objects of the Invention are provided by one or 
more of the emlxxliments which are descrit)ed below. In one embodi- 
ment of the present Invention a method of diagnosing or prognosing a 
neoplastic tissue of a human is provided comprising: detecting somatic 
alteration of wild-type APC genes or their expression products in a 
sporadic colorectal cancer tissue, said alteration indicating neoplasia of 
the tissue. 

In yet another embodiment a method Is provided of detecting 
genetic predisposition to cancer in a human including familial 
adenomatous polyposis (FAP) and Gardner's Syndrome (GS), comprising: 
isolating a human sample selected from the group consisting of blood 
and fetal tissue; detecting alteration of wild-type APC gene coding 
sequences or their expression products from the sample, said alteration 
indicating genetic predisposition to cancer. 

In another embodiment of the present invention a method is 
provided for supplying wild- type APC gene function to a cell which has 
lost said gene function by virtue of a mutation in the APC gene, com- 
prising: introducing a wild-type APC gene into a cell which has lost 
said gene function such that said wild-type gene is expressed in the 
cell. 

In another embodiment a method of supplying wild-type APC 
gene function to a cell is provided comprising: introducing a portion of 
a wild- type APC gene into a cell which has lost said gene function such 
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that said portion is expressed In the cell, said portion encoding a part 
of the ARC protein which is required for non-neoplastic growth of said 
cell. A PC protein can also be applied to cells or administered to ani- 
mals to remediate for mutant A PC genes. Synthetic peptides or drugs 
can also be used to mimic APC function in cells which have altered 
A PC expression. 

In yet another embodiment a pair of single stranded primers is 
provided for determination of the nucleotide sequence of the APC gene 
by polymerase chain reaction. The sequence of said pair of single 
stranded DMA primers is derived from chromosome 5q band 21, said 
pair of primers allowing synthesis of APC gene coding sequences. 

In still another embodiment of the invention a nucleic acid probe 
Is provided which is complementary to human wild-type APC gene cod- 
ing sequences and which can form mismatches with mutant APC genes, 
thereby allowing their detection by enzymatic or chemical cleavage or 
by shifts in electrophoretic mobility. 

In another embodiment of the Invention a method is provided for 
detecting the presence of a neoplastic tissue in a human. The method 
comprises isolating a body sample from a human; detecting in said sam- 
ple alteration of a wild- type APC gene sequence or wild-type APC 
expression product, said alteration Indicating the presence of a 
neoplastic tissue in the human. 

In still another embodiment a cDNA molecule is provided which 
comprises the coding sequence of the APC gene. 

In even another embodiment a preparation of the human APC 
protein Is provided which is substantially free of other human proteins. 
The amino acid sequence of the protein Is shown in Figure 3 or 7. 

In yet another embodiment of the invention a method is provided 
for screening for genetic predisposition to cancer, Including familial 
adenomatous polyposis (FAP) and Gardner's Syndrome (GS), in a human. 
The method comprises: detecting among kindred persons the presence 
of a DNA polymorphism which is linked to a mutant APC allele in an 
individual having a genetic predisposition to cancer, said kindred being 
genetically related to the individual, the presence of said polymorphism 
suggesting a predisposition co cancer- 
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In another embodiment of the invention a method of testing 
therapeutic agents for the ability to suppress a neoplastically trans- 
formed phenotype is provided. The method comprises: applying a test 
sutjstance to a cultured epithelial cell which carries a mutation in an 
APC allele; and determining whether said test substance suppresses 
the neoplastically transformed phenotype of the cell. 

In another embodiment of the invention a method of testing 
therapeutic agents for the ability to suppress a neoplastically trans- 
formed phenotype is provided. The method comprises: administering a 
test substance to an animal which carries a mutant APC allele; and 
determining whether said test substance prevents or suppresses the 
growth of tumors. 

In still other embodiments of the invention transgenic animals 
are provided. The animals carry a mutant APC allele from a second 
animal species or have been genetically engineered to contain an inser- 
tion mutation which disrupts an APC allele. 

The present invention provides the art with the information that 
the APC gene, a heretofore unknown gene is, in fact, a target of muta- 
tional alterations on chromosome 5q21 and that these alterations are 
associated with the process of tumorigenesls. This information allows 
highly specific assays to l^e performed to assess the neoplastic status of 
a particular tissue or the predisposition to cancer of an individual. This 
Invention has applicability to Familial Adenomatous Polyposis, sporadic 
colorectal cancers, Gardners Syndrome, as well as the less severe 
familial polyposis discusses above. 
BRIEF DESCRIPTION OF THE DRAWINGS 

Figure lA shows an overview of yeast artificial chromosome 
(YAC) contigs. Genetic distances t)etween selected RFLP markers 
from within the contigs are shown in centiMorgans. 

Figure IB shows a detailed map of the three central contigs. 
The position of the six identified genes from within the FAP region is 
shown; the 5' and 3' ends of the transcripts from these genes have in 
general not yet been isolated, as indicated by the string of dots sur- 
rounding the bars denoting the genes' positions. Selected restriction 
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endonuclease reco^ition sites are indicated, B, BssH2; S, Sstll; 
M, Mlul; N, Nrul. 

Figure 2 shows the sequence of TBI and TB2 genes. The cDNA 
sequence of the TBI gene was determined from the analysis of 11 
cDNA clones derived from normal colon and liver, as described in the 
text. A total of 2314 bp were contained within the overlapping cDNA 
clones, defining an ORF of 424 amino acids t)eginning at nucleotide l. 
Only the predicted amino acids from the ORF are shown. The 
cartx)xy-terminal end of the ORF has apparently been Identified, but 
the 5' end of the TBI transcript has not yet been precisely determined. 

The cDNA sequence of the TB2 gene was determined from the 
YS-39 clone derived as described in the text. This clone consisted of 
2300 bp and defined an ORF of 185 amino acids beginning at nucleotide 
1. Only the predicted amino acids are shown. The carboxy terminal 
end of the ORF has apparently been identified, but the 5' end of the 
TB2 transcript has not been precisely determined. 

Figure 3 shows the sequence of the APC gene product. The 
cDNA sequence was determined through the analysis of 87 cDNA clones 
derived from normal colon, liver, and brain. A total of 8973 bp were 
contained within overlapping cDNA clones, defining an ORF of 2842 
amino acids. In frame stop codons surrounded this ORF, as descrit^ed in 
the text, suggesting that the entire APC gene product was represented 
In the ORF illustrated* Only the predicted amino acids are shown. 

Figure 4 shows the local similarity between human APC and ral2 
of yeast. Local similarity among the APC and MCC genes and the m3 
muscarinic acetylcholine receptor is shown. The region of the mAChR 
shown corresponds to that responsible for coupling the receptor to G 
proteins. The connecting lines indicate identities; dots indicate related 
amino acids residues. 

Figure 5 shows the genomic map of the 1200 kb NotI fragment at 
the FAP locus. The NotI fragment is shown as a bold line. Relevant 
parts of the deletion chromosomes from patients 3214 and 3824 are 
shown as stippled lines. Probes used to characterize the NotI fragment 
and the deletions, and three YACs from which subclones were obtained, 
are shown below the restriction map. The chimeric end of YAC 
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183H12 is indicated by a dotted line. The orientation and approximate 
position of MCC are indicated atxjve the map. 

Figure 6 shows the DNA sequence and predicted amino acid 
sequence of dpi (tB2). The nucleotide numbering begins at the most 5' 
nucleotide isolated. A proposed initiation methionine (base 77) is indi- 
cated in bold type. The entire coding sequence is presented. 

Figure 7 shows the cDNA and predicted amino acid sequence of 
DP2.5 (APC). The nucleotide numbering begins at the proposed initia- 
tion methionine. The nucleotides and amino acids of the alternatively 
spliced exon (exon 9; nucleotide positions 934-1236) are presented in 
lower case letters. At the 3' end. a poly(A) addition signal occurs at 
9530. and one cDNA clone has a poly(A) at 9563. Other cDNA clones 
extend beyond 9563. however, and their consensus sequence is included 
here. 

Figure 8 shows the arrangement of exons in DP2.5 (APC). 
(A) Exon 9 corresponds to nucleotides 933-1312; exon 9a corresponds to 
nucleotides 1236-1312. The stop codon in the cDNA is at nucleotide 
8535. (B) Partial intronic sequence surrounding each exon is shown. 
DETAn P p DESCRrPTTnitf 

It is a discovery of the present invention that mutational events 
associated with tumorigenesis occur in a previously unknown gene on 
chromosome 5q named here the APC (Adenomatous Polyposis Coli) 
gene. Although it was previously known that deletion of alleles on 
chromosome 5q were common in certain types of cancers, it was not 
known that a target gene of these deletions was the APC gene. Fur- 
ther it was not known that other types of mutational events in the APC 
gene are also associated with cancers. The mutations of the APC gene 
can involve gross rearrangements, such as insertions and deletions. 
Point mutations have also been observed. 

According to the diagnostic and prognostic method of the 
present invention, alteration of the wild-type APC gene is detected. 
"Alteration of a wild-type gene" according to the present invention 
encompasses all forms of mutations - including deletions. The alter- 
ation may be due to either rearrangements such as insertions, inver- 
sions, and deletions, or to point mutations. Deletions may be of the 
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entire gene or only a portion of the gene. Somatic mutations are those 
which occur only In certain tissues, e,g,. In the tumor tissue, and are 
not inherited in the germline, Germline mutations can be found in any 
of a body's tissues. If only a single allele is somatically mutated, an 
early neoplastic state is indicated. However, if both alleles are 
mutated. then a late neoplastic state is Indicated. The finding of APC 
mutations thus provides both diagnostic and prognostic information. 
An APC aUele which is not deleted (e.g., that on the sister chromosome 
to a chromosome carrying an APC deletion) can be screened for other 
mutations, such as insertions, small deletions, and point mutations. It 
is believed that many mutations found in tumor tissues will be those 
leading to decreased expression of the APC gene product. However, 
mutations leading to non-functional gene products would also lead to a 
cancerous state. Point mutational events may occur in regulatory 
regions, such as in the promoter of the gene, leading to loss or diminu- 
tion of expression of the mRNA. Point mutations may also abolish 
proper RNA processing, leading to loss of expression of the APC gene 
product. 

In order to detect the alteration of the wild-type APC gene in a 
tissue. It is helpful to isolate the tissue free from surrounding normal 
tissues. Means for enriching a tissue preparation for tumor cells are 
known in the art. For example, the tissue may be isolated from paraf- 
fin or cryostat sections. Cancer cells may also be separated from nor- 
mal cells by flow cytometry. These as well as other techniques for 
separating tumor from normal cells are well known in the art. If the 
tumor tissue is highly contaminated with normal cells, detection of 
mutations is more difficult. 

Detection of point mutations may be accomplished by molecular 
cloning of the APC allele (or alleles) and sequencing that allele{s) using 
techniques well known in the art. Alternatively, the polymerase chain 
reaction (PCR) can l5e used to amplify gene sequences directly from a 
genomic DMA preparation from the tumor tissue. The DNA sequence 
of the amplified sequences can then be determined. The polymerase 
chain reaction Itself Is well known in the art. See, e.g., Saiki et al.. 
Science, Vol. 239, p. 487, 1988; U.S. 4,683,203; and U.S. 4,683,195, 
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Specific primers which can be used in order to amplify the gene will 
be discussed in more detail below. The ligase chain reaction, which is 
known in the art, can also be used to amplify APC sequences. See Wu 
et al.. Genomics . Vol. 4, pp. 560-569 (1989). In addition, a technique 
known as allele specific PGR can be used. (See Ruano and Kidd, 
Nucleic Acids Research, Vol. 17, p. 8392, 1989.) According to this 
technique, primers are used which hybridize at their 3* ends to a par- 
ticular APC mutation. If the particular APC mutation is not present, 
an amplification product is not observed. Amplification Refractory 
Mutation System (ARMS) can also be used as disclosed in European 
Patent Application Publication No. 0332435 and in Newton et al.. 
Nucleic Acids Research, Vol. 17, p.7, 1989. Insertions and deletions of 
genes can also be detected by cloning, sequencing and amplification. In 
addition, restriction fragment length polymorphism (RFLP) probes for 
the gene or surrounding marker genes can be used to score alteration 
of an allele or an insertion in a polymorphic fragment. Such a method 
is particularly useful for screening among kindred persons of an 
affected individual for the presence of the APC mutation found In that 
individual. Single stranded conformation polymorphism (SSCP) analysis 
can also be used to detect base change variants of an allele. (Orita et 
al., Proc- NaU. Acad. Sci. USA Vol. 86, pp. 2766-2770, 1989, and 
Genomics, VoL 5, pp. 874-879, 1989.) Other techniques for detecting 
insertions and deletions as are known in the art can be used. 

Alteration of wild-type genes can also be detected on the basis 
of the alteration of a wild-type expression product of the gene. Such 
expression products include both the APC mRNA as well as the APC 
protein product. The sequences of these products are shown in 
Figures 3 and 7, Point mutations may he detected by amplifying and 
sequencing the mRNA or via molecular cloning of cDNA made from the 
mRNA, The sequence of the cloned cDNA can be determined using 
DNA sequencing techniques which are well known in the art. The 
cDNA can also be sequenced via the polymerase chain reaction (PCR) 
which will be discussed in more detail below. 

Mismatches, according to the present invention are hybridized 
nucleic acid duplexes which are not 100% homologous. The lack of 
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total homology may be due to deletions, Insertions, inversions, substitu- 
tions or frameshift mutations. Mismatch detection can be used to 
detect point mutations in the gene or its mRNA product. While these 
techniques are less sensitive than sequencing, they are simpler to per- 
form on a large number of tumor samples. An example of a mismatch 
cleavage technique is the RNase protection method, which is described 
in detail In Winter et al., Proc. Natl. Acad, Sci. USA, Vol. 82, p. 7575, 
1985 and Meyers et aL, Science, Vol. 230. p. 1242, 1985. In the practice 
of the present invention the method involves the use of a labeled 
riboprobe which is complementary to the human wild-type APC gene 
coding sequence. The riboprobe and either mRNA or DNA Isolated 
from the tumor tissue are annealed (hybridized) together and subse- 
quently digested with the enzyme RNase A which is able to detect 
some mismatches in a duplex RNA structure. If a mismatch is detected 
by RNase A, it cleaves at the site of the mismatch. Thus, when the 
annealed RNA preparation is separated on an electrophoretic gel 
matrix, if a mismatch has been detected and cleaved by RNase A, an 
RNA product will be seen which Is smaller than the full-length duplex 
RNA for the riboprobe and the mRNA or DNA. The riboprobe need not 
be the full length of the APC mRNA or gene but can be a segment of 
either. If the riboprobe comprises only a segment of the APC mRNA or 
gene it wUl be desirable to use a number of these probes to screen the 
whole mRNA sequence for mismatches. 

In similar fashion, DNA probes can be used to detect mis- 
matches, through enzymatic or chemical cleavage. See, e.g.. Cotton et 
al., Proc. Natl. Acad. Sci. USA, Vol. 85, 4397, 1988; and Shenk et al., 
Proc. Natl. Acad. Sci. USA, Vol. 72, p. 989, 1975. Alternatively, mis- 
matches can be detected by shifts in the electrophoretic mobility of 
mismatched duplexes relative to matched duplexes. See, e.g., Carlello, 
Human Genetics, Vol. 42, p. 726, 1988. With either rlboprobes or DNA 
probes, the cellular mRNA or DNA which might contain a mutation can 
be amplified using PCR (see below) before hybridization. Changes in 
DNA of the APC gene can also be detected using Southern hybridiza- 
tion, especially il the changes are gross rearrangements, such as dele- 
tions and insertions. 
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DNA sequences of the A PC gene which have been amplified by 
use of polymerase chain reaction may also be screened using allele-spe- 
cific probes. These probes are nucleic acid oligomers, each of which 
contains a region of the A PC gene sequence harboring a known muta- 
tion. For example, one oligomer may be about 30 nucleotides in length, 
corresponding to a portion of the A PC gene sequence. By use of a bat- 
tery of such allele-specific probes, PCR amplification products can be 
screened to identify the presence of a previously identified mutation in 
the A PC gene. Hybridization of allele-specific probes with amplified 
APC sequences can be performed, for example, on a nylon fUter. 
Hybridization to a particular probe under stringent hybridization condi- 
tions Indicates the presence of the same mutation in the tumor tissue 
as in the allele-specific probe. 

Alteration of APC mRNA expression can be detected by any 
technique known in the aru These include Northern blot analysis, PCR 
amplification and RNase protection. Diminished mRNA expression 
indicates an alteration of the wild-type APC gene. 

Alteration of wild-type APC genes can also be detected by 
screening for alteration of wild-type APC protein. For example, 
monoclonal antibodies immunoreactive with APC can be used to screen 
a tissue. Lack of cognate antigen would indicate an APC mutation. 
Antibodies specific for products of mutant aHeles could also be used to 
detect mutant APC gene product. Such immunological assays can be 
done in any convenient format known in the art. These include West- 
ern blots, immunohistochemical assays and EUSA assays. Any means 
for detecting an altered APC protein can be used to detect alteration 
of wild-type APC genes. Functional assays can be used, such as protein 
binding determinations. For example, it is believed that APC protein 
oligomerizes to itself and/or MCC protein or binds to a G protein. 
Thus, an assay for the ability to bind to wild type APC or MCC protein 
or that G protein can be employed. In addition, assays can be used 
which detect APC biochemical function. It Is believed that APC Is 
involved in phospholipid metabolism. Thus, assaying the enzymatic 
products of the involved phospholipid metabolic pathway can be used to 
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determine A PC activity. Finding a mutant A PC gene product indicates 
alteration of a wild-type APC gene. 

Mutant APC genes or gene products can also be detected in 
other human body samples, such as, serum, stool, urine and sputum. 
The same techniques discussed above for detection of mutant APC 
genes or gene products in tissues can be applied to other body samples. 
Cancer cells are sloughed off from tumors and appear in such body 
samples. In addition, the APC gene product itself may be secreted into 
the extracellular space and found in these body samples even in the 
absence of cancer cells. By screening such body samples, a simple 
early diagnosis can be achieved for many types of cancers. In addition, 
the progress of chemotherapy or radiotherapy can be monitored more 
easily by testing such body samples for mutant APC genes or gene 
products. 

The methods of diagnosis of the present invention are applicable 
to any tumor in which APC has a role in tumorigenesis. Deletions of 
chromosome arm 5q have been observed in tumors of lung, breast, 
colon, rectum, bladder, liver, sarcomas, stomach and prostate, as well 
as in leukemlas and lymphomas. Thus these are likely to be tumors in 
which APC has a role. The diagnostic method of the present invention 
Is useful for clinicians so that they can decide upon an appropriate 
course of treatment. For example, a tumor displaying alteration of 
both APC alleles might suggest a more aggressive therapeutic regimen 
than a tumor displaying alteration of only one APC allele. 

The primer pairs of the present invention are useful for determi- 
nation of the nucleotide sequence of a particular APC allele using the 
polymerase chain reaction. The padrs of single stranded DNA primers 
can be annealed to sequences within or surrounding the APC gene on 
chromosome 5q in order to prime amplifying DNA synthesis of the APC 
gene itself, A complete set of these primers allows synthesis of aU of 
the nucleotides of the APC gene coding sequences, i.e., the exons. The 
set of primers preferably allows synthesis of both intron and exon 
sequences- Allele specific primers can also be used. Such primers 
anneal only to particular APC mutant alleles, and thus will only amplify 
a product in the presence of the mutant allele as a template. 
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In order to facilitate subsequent cloning of amplified sequences, 
primers may have restriction enzyme site sequences appended to their 
5' ends. Thus, all nucleotides of the primers are derived from A PC 
sequences or sequences adjacent to APC except the few nucleotides 
necessary to form a restriction enzyme site. Such enzymes and sites 
are well known In the art. The primers themselves can be synthesized 
using techniques which are well known in the art. Generally, the prim- 
ers can be made using oligonucleotide synthesizing machines which are 
commercially available. Given the sequence of the APC open reading 
frame shown in Figure 7, design of particular primers is well within the 
skill of the art. 

The nucleic acid probes provided by the present invention are 
useful for a number of purposes. They can be used in Southern hybrid- 
ization to genomic DMA and in the RNase protection method for 
detecting point mutations already discussed above. The probes can be 
used to detect PGR amplification products. They may also be used to 
detect mismatches with the APC gene or mRNA using other tech- 
niques. Mismatches can be detected using either enzymes (e.g.. Si 
nuclease), chemicals (e,g., hydroxylamine or osmium tetroxide and 
piperidine), or changes in electrophoretic mobility of mismatched 
hybrids as compared to totally matched hybrids. These techniques are 
known in the art. See, Cotton, supra . Shenk, supra , Myers, supra . Win- 
ter, supra , and Novack et al., Proc. Natl. Acad. Sci. USA, Vol. 83, p. 
586, 1986. Generally, the probes are complementary to APC gene cod- 
ing sequences, although probes to certain introns are also contem- 
plated. An entire battery of nucleic acid probes is used to compose a 
kit for detecting alteration of wild- type APC genes. The kit allows for 
hybridization to the entire APC gene. The probes may overlap with 
each other or be contiguoiis. 

If a riboprobe is used to detect mismatches with mRNA, it is 
complementary to the mRNA of the human wild-type APC gene. The 
riboprobe thus Is an anti-sense probe in that it does not code for the 
APC protein because it is of the opposite polarity to the sense strand. 
The riboprobe generally will be laljeled with a radioactive, 
colorimetric, or fluorometric material, which can be accomplished by 
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any means known In the art. If the riboprobe Is used to detect mis- 
matches with DNA it can be of either polarity, sense or anti-sense. 
Similarly, DNA probes also may be used to detect mismatches. 

Nucleic acid probes may also be complementary to mutant 
alleles of the APC gene. These are useful to detect similar mutations 
in other patients on the basis of hybridization rather than mismatches. 
These are discussed above and referred to as allele-specif ic probes. As 
mentioned above, the APC probes can also be used in Southern hybrid- 
izations to genomic DNA to detect gross chromosomal changes such as 
deletions and insertions. The probes can also be used to select cDNA 
clones of APC genes from tumor and normal tissues. In addition, the 
prot)es can be used to detect APC mRNA in tissues to determine if 
expression is diminished as a result of alteration of wild-type APC 
genes. Provided with the APC coding sequence shown in Figure 7 (SEQ 
ID NO: 1), design of particular probes is well within the skill of the 
ordinary artisan. 

According to the present invention a method is also provided of 
supplying wild-type APC function to a cell which carries mutant APC 
alleles. Supplying such function should suppress neoplastic growth of 
the recipient cells. The wild-type APC gene or a part of the gene may 
be introduced into the cell in a vector such that the gene remains 
extrachromosomal. In such a situation the gene will be expressed by 
the cell from the extrachromosomal location. If a gene portion is 
introduced and expressed in a cell carrying a mutant APC allele, the 
gene portion should encode a part of the APC protein which is required 
for non-neoplastic growth of the cell. More preferred is the situation 
where the wild- type APC gene or a part of it is introduced into the 
mutant cell in such a way that it recombines with the endogenous 
mutant APC gene present in the cell. Such recombination requires a 
double recombination event which results in the correction of the APC 
gene mutation. Vectors for introduction of genes both for recombina- 
tion and for extrachromosomal maintenance are known in the art and 
any suitable vector may be used. Methods for introducing DNA into 
cells such as electroporation, calcium phosphate co-precipitation and 
viral transduction are known in the art and the choice of method is 
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within the competence of the routineer. Cells transformed with the 
wild- type A PC gene can be used as model systems to study cancer 
remission and drug treatments which promote such remission. 

Similarly, cells and animals which carry a mutant A PC allele can 
be used as model systems to study and test for substances which have 
potential as therapeutic agents. The cells are typically cultured 
epithelial cells. These may be isolated from individuals with A PC 
mutations, either somatic or germline. Alternatively, the cell line can 
be engineered to carry the mutation in the APC allele. After a test 
substance Is applied to the ceUs, the neoplastically transformed pheno- 
type of the cell will be determined. Any trait of neoplastically trans- 
formed cells can be assessed, including anchorage-independent growth, 
tumorigenicity in nude mice, invasiveness of cells, and growth factor 
dependence. Assays for each of these traits are known in the art. 

Animals for testing therapeutic agents can be selected after 
mutagenesis of whole animals or after treatment of germline cells or 
zygotes. Such treatments include insertion of mutant APC alleles, usu- 
ally from a second animal species, as well as insertion of disrupted 
homologous genes. Alternatively, the endogenous APC gene(s) of the 
animals may be disrupted by Insertion or deletion mutation. After test 
substances have been administered to the animals, the growth of 
tumors must be assessed. If the test substance prevents or suppresses 
the growth of tumors, then the test substance is a candidate therapeu- 
tic agent for the treatment of FAP and/or sporadic cancers. 

Polypeptides which have APC activity can be supplied to cells 
which carry mutant or missing APC alleles. The sequence of the APC 
protein is disclosed in Figure 3 or 7 (SEQ ID NO: 7 or 1). These two 
sequences differ slightly and appear to be indicate the existence of two 
different forms of the APC protein. Protein can be produced by 
expression of the cDNA sequence in bacteria, for example, using known 
expression vectors. Alternatively. APC can be extracted from APC- 
producing mammalian cells such as brain cells. In addition, the tech- 
niques of synthetic chemistry can be employed to synthesize APC pro- 
tein. Any of such techniques can provide the preparation of the 
present invention which comprises the APC protein. The preparation 
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is substantially free of other human proteins. This is most readily 
accomplished by synthesis in a microorganism or in vitro . 

Active APC molecules can be introduced into cells by 
microinjection or by use of liposomes, for example. Alternatively, 
some such active molecules may be taken up by cells, actively or by 
diffusion. Extracellular application of APC gene product may be suffi- 
cient to affect tumor growth. Supply of molecules with APC activity 
should lead to a partial reversal of the neoplastic state. Other mole- 
cules with APC activity may also be used to effect such a reversal, for 
example peptides, drugs, or organic compounds. 

The present invention also provides a preparation of antibodies 
immunoreactive with a human APC protein. The antibodies may be 
polyclonal or monoclonal and may be raised against native APC pro- 
tein, APC fusion proteins, or mutant APC proteins. The antibodies 
should be immunoreactive with APC epitopes, preferably epitopes not 
present on other human proteins. In a preferred emlxxliment of the 
invention the antibodies will immunoprecipitate APC proteins from 
solution as well as react with APC protein on Western or immunoblots 
of polyacrylamide gels. In another preferred embodiment, the antibod- 
ies will detect APC proteins In paraffin or frozen tissue sections, using 
immunocytochemical techniques. Techniques for raising and purifying 
antibodies are well known in the art and any such techniques may be 
chosen to achieve the preparation of the invention. 

Predisposition to cancers as in FAP and GS can be ascertained 
Ijy testing any tissue of a human for mutations of the APC gene. For 
example, a person who has inherited a germline APC mutation would be 
prone to develop cancers. This can be determined by testing DNA from 
any tissue of the person's body. Most simply, blood can be drawn and 
DNA extracted from the cells of the blood. In addition, prenatal diag- 
nosis can be accomplished by testing fetal cells, placental cells, or 
amniotic fluid for mutations of the APC gene. Alteration of a wild- 
type APC allele, whether for example, by point mutation or by dele- 
tion, can be detected by any of the means discussed above. 

Molecules of cDNA according to the present invention are 
intron-free, APC gene coding molecules. They can be made by reverse 
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transcriptase using the APC mRNA as a template. These molecules 
can be propagated in vectors and cell lines as is known in the art. Such 
molecules have the sequence shown in SEQ ID NO: 7. The cDNA can 
also be made using the techniques of synthetic chemistry given the 
sequence disclosed herein. 

A short region of homology has been identifiied between APC and 
the human m3 muscarinic acetylcholine receptor (mAChR). This 
homology was largely confined to 29 residues In which 6 out of 7 amino 
acids (EL(GorA)GLQA) were identical (See Figure 4), Initially, it was 
not known whether this homology was significant, because many other 
proteins had higher levels of global homology (though few had six out of 
seven contiguous amino acids in common). However, a study on the 
sequence elements controlling G protein activation by mAChR subtypes 
(Lechleiter et al., EMBO J., p. 4381 (1990)) has shown that a 21 amino 
acid region from the m3 mAChR completely mediated G protein speci- 
ficity when substituted for the 21 amino acids of m2 mAChR at the 
analogous protein position. These 21 residues overlap the 19 amino acid 
homology between APC and m3 mAChR. 

This connection between APC and the G protein activating 
region of mAChR is intriguing in light of previous Investigations relat- 
ing G proteins to cancer. For example, the RAS oncogenes, which are 
often mutated in colorectal cancers (Vogelstein, et al., N. Engl. j. 
Med., Vol. 319, p. 525 (1988); Bos et al.. Nature Vol. 327, p. 293 (1987)), 
are members of the G protein family (Bourne, et al.. Nature, Vol. 348, 
p. 125 (1990)) as is an in vitro transformation suppressor (Noda et al., 
Proc. Natl. Acad. Sci. USA, Vol. 86, p. 162 (1989)) and genes mutated in 
hormone producing tumors (Candls et al.. Nature, Vol. 340, p. 692 
(1989); Lyons et al.. Science, Vol. 249, p. 655 (1990)). Additionally, the 
gene responsible for neurofibromatosis (presumably a tumor suppressor 
gene) has been shown to activate the GTPase activity of RAS (Xu et al.. 
Cell, Vol. 63, p. 835 (1990); Martin et al., Cell, Vol. 63, p. 843 (1990); 
Ballester et al., CeU, Vol. 63, p. 851 (1990)). Another interesting Unk 
between G proteins and colon cancer involves the drug sulindac. This 
agent has been shown to inhibit the growth of benign colon tumors in 
patients with FAP, presumably by virtue of its activity as a 
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cyclooxygenase inhibitor (Waddell et al., J. Surg. Oncology 24(1), 83 
(1983); WadeU, et al.. Am, J. Surg., 157(1), 175 (1989); Charneau et al., 
Gastroenterologie Clinique at Biologique 14(2), 153 (1990)). 
Cyclooxygenase is required to convert arachidonic acid to 
prostaglandins and other biologically active molecules. G proteins are 
known to regulate phospholipase A2 activity, which generates 
arachidonic acid from phospholipids (Role et aL, Proc. NatL Acad. Scl. 
USA, Vol. 84, p. 3623 (1987); Kurachi et al.. Nature, Vol. 337, 12 555 
(1989)). Therefore we propose that wild-type APC protein functions by 
interacting with a G protein and is involved in phospholipid 
metabolism. 

The following are provided for exemplification purposes only and 
are not intended to limit the scope of the invention which has been 
described in broad terms above. 
Example l ; 

This example demonstrates the Isolation of a 5.5 Mb region of 
human DNA linked to the FAP locus. Six genes are identified in this 
region, all of which are expressed in normal colon cells and in 
colorectal, lung, ad bladder tumors. 

The cosmid markers YN5.64 and YN5.48 have previously been 
shown to delimit an 8 cM region containing the locus for FAP 
(Nakamura et al.. Am. J. Hum. Genet. Vol. 43, p. 638 (1988)). Further 
linkage and pulse-field gel electrophoresis (PFGE) analysis with addi- 
tional markers has shown that the FAP locus is contained within a 4 cM 
region bordered by cosmids EF5.44 and L5.99. In order to isolate clones 
representing a significant portion of this locus, a yeast artificial chro- 
mosome (YAC) library was screened with various 5q21 markers. 
Twenty-one YAC clones, distributed within six contigs and including 
5.5 Mb from the region Ijetween YN5.64 and YN5.48, were obtained 
(Figure lA). 

Three contigs encompassing approximately 4Mb were contained 
within the central portion of this region. The YAC*s constituting these 
contigs, together with the markers used for their isolation and orienta- 
tions, are shown in Figure 1. These YAC contigs were obtained in the 
following way. To initiate each contig, the sequence of a genomic 
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marker cloned from chromosome 5q2l wats determined and used to 
design primers for PGR, PGR was then carried out on pools of YAG 
clones distributed In microliter trays as previously described (Anand 
et alM Nucleic Acids Research, Vol. 18, p- 1951 (1980)), Individual YAG 
clones from the positive pools were identified by further PGR or 
hybridization based assays, and the YAG sizes were determined by 
PFGE. 

To extend the areas covered by the original YAG clones, "chro- 
mosomal walking" was performed. For this purpose, YAG termini were 
isolated by a PGR based method and sequenced (Riley et al., Nucleic 
Acids Research, Vol. 18, p. 2887 (1990)). PGR primers based on these 
sequences were then used to rescreen the YAG library. For example, 
the sequence from an Intron of the FER gene (Hao et aL, Mol. Gell. 
Biol., Vol. 9, p. 1587 (1989)) was used to design PGR primers for isola- 
tion of the 28EG1 and 5EH8 YAGs. The termini of the 28EG1 YAG 
were sequenced to derive markers RHE28 and LHE28, respectively. 
The sequences of these two markers were then used to isolate YAG 
clones 15GH12 (from RHE28) and 40GF1 and 29EF1 (from LHE28). 
These five YAC's formed a con tig encompassing 1200 kb (contig l, 
Figure IB). 

Similarly, contig 2 was initiated using cosmid N5.66 sequences, 
and contig 3 was initiated using sequences both from the MGG gene and 
from cosmid EF5.44, A walk in the telomeric direction from YAG 
14FH1 and a walk in the opposite direction from YAG 39GG3 aUowed 
connection of the initial contig 3 clones through YAG 37HG4 
(Figure IB). 

Multipoint linkage analysis with the various markers used to 
define the contigs, combined with PFGE analysis, showed that contigs 1 
and 2 were centromerlc to contig 3. These contigs were used as tools 
to orient and/or identify genes which might be responsible for FAP. 
Six genes were found to lie within this cluster of YAG*s, as follows: 

Gontig #1: FER - The FER gene was discovered through its 
homology to the viral oncogene ABL (Hao et al., supra ). It has an 
intrinsic tyrosine kinase activity, and in situ hybridization with an FER 
probe showed that the gene was located at 5q 11-23 (Morris et aL, 
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Cytogenet. Cell. Genet., VoL 53, p. 4, (1990)). Because of the potential 
role of this oncogene-related gene in neoplasia, we decided to evaluate 
it further with regards to the FAP locus. A human genomic clone from 
FER was Isolated (MF 2.3) and used to define a restriction fragment 
length polymorphism (RFLP), and the RFLP in turn used to map FER by 
linkage analysis using a panel of three generation families. This 
showed that FER was very tightly linked to previously defined 
polymorphic markers for the FAP locus. The genetic mapping of FER 
was complemented by physical mapping using the YAC clones derived 
from FER sequences (Figure IB). Analysis of YAC contig 1 showed that 
FER was within 600 kb of cosmid marker M5.28, which maps to within 
1.5 Mb of cosmid L5.99 by PFGE of human genomic DNA. Thus, the 
YAC mapping results were consistent with the FER linkage data and 
PFGE analyses. 

Contig 2: TBI - TBI was identified through a cross-hybridization 
approach. Exons of genes are often evolutionarily conserved while 
introns and intergenic regions are much less conserved. Thus, if a 
human probe cross-hybridizes strongly to the DNA from non-primate 
species, there is a reasonable chance that it contains exon sequences. 
Subclones of the cosmids shown in Figure 1 were used to screen South- 
ern blots containing rodent DNA samples. A subclone of cosmid N5.66 
(p 5 •66-4) was shown to strongly hybridize to rodent DNA, and this 
clone was used to screen cDNA libraries derived from normal adult 
colon and fetal liver. The ends of the initial cDNA clones obtained in 
this screen were then used to extend the cDNA sequence. Eventually, 
11 cDNA clones were isolated, covering 2314 bp. The gene detected by 
these clones was named TBI, Sequence analysis of the overlapping 
clones revealed an open reading frame (ORF) that extended for 1302 bp 
starting from the most 5* sequence data obtained (Figure 2A). If this 
entire open reading frame were translated, it would encode 434 amino 
acids. The product of this gene was not globally homologous to any 
other sequence in the current database but showed two significant local 
similarities to a family of ADP, ATP carrier/translocator proteins and 
mitochondrial brown fat uncoupling proteins which are widely distrib- 
uted from yeast to mammals. These conserved regions of TBI 
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(underlined In Figure 2A) may define a predictive motif for this 
sequence family. In addition, TBI appeared to contain a signal peptide 
(or mitochondrial targeting sequence) as well as at least 7 
transmembrane domains. 

Contig 3: MCC, TB2, SRP and APC - The MCC gene was also 
discovered through a cross-hybridization approach, as described previ- 
ously (Kinzler et al.. Science Vol. 251, p. 1366 (1991)). The MCC gene 
was considered a candidate for causing FAP by virtue of its tight 
genetic linkage to FAP susceptibility and its somatic mutation in spo- 
radic colorectal carcinomas. However, mapping experiments suggested 
that the coding region of MCC was approximately 50 kb proximal to 
the centromerlc end of a 200 kb deletion found in an FAP patient. 
MCC cDNA probes detected a 10 kb mRNA transcript on Northern blot 
analysis of which 4151 bp, including the entire open reading frame, 
have been cloned. Although the 3' non-translated portion or an alter- 
natively spliced form of MCC might have extended into this deletion, it 
was possible that the deletion did not affect the MCC gene product. 
We therefore used MCC sequences to Initiate a YAC contig, and subse- 
quently used the YAC clones to identify genes 50 to 250 kb distal to 
MCC that might be contained within the deletion. 

In a first approach, the insert from YAC24ED6 (Figure IB) was 
radiolabelled and hybridized to a cDNA library from normal colon. One 
of the cDNA clones (YS39) identified In this manner detected a 3.1 kb 
mRNA transcript when used as a probe for Northern blot hybridization. 
Sequence analysis of the YS39 clone revealed that it encompassed 2283 
nucleotides and contained an ORF that extended for 555 bp from the 
most 5' sequence data obtained. If all of this ORF were translated, it 
would encode 185 amino acids (Figure 23). The gene detected by YS39 
was named TB2. Searches of nucleotide and protein databases revealed 
that the TB2 gene was not identical to any previously reported 
sequences nor were there any striking similarities. 

Another clone (YSll) identified through the YAC 24ED6 screen 
appeared to contain portions of two distinct genes. Sequences from 
one end of YSll were identical to at least 180 bp of the signal recogni- 
tion particle protein SRP19 (Lingelbach et al. Nucleic Acids Research, 
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Vol. 16, p. 9431 (1988). A second ORF, from the opposite end of clone 
YSll, proved to be identical to 78 bp of a novel gene which was inde- 
pendently identified through a second YAC-based approach. For the 
latter, DNA from yeast cells containing YAC 14FH1 (Figure IB) was 
digested with EcoRI and subcloned into a plasmid vector. Plasmids that 
contained human DNA fragments were selected by colony hybridization 
using total human DNA as a probe. These clones were then used to 
search for cross-hybridizing sequences as described atove for TBI, and 
the cross-hybridizing clones were subsequently used to screen cDNA 
libraries. One of the cDNA clones discovered in this way (FH38) con- 
tained a long ORF (2496 bp), 78 bp of which were identical to the 
above-noted sequences in YSll. The ends of the FH38 cDNA clone 
were then used to Initiate cDNA walking to extend the sequence. 
Eventually, 85 cDNA clones were isolated from normal colon, brain and 
liver cDNA libraries and found to encompass 8973 nucleotides of con- 
tiguous transcript. The gene corresponding to this transcript was 
named APC. When used as probes for Northern blot analysis, APC 
cDNA clones hybridized to a single transcript of approximately 9.5 kb, 
sureesting that the great majority of the gene product was represented 
in the cDNA clones obtained. Sequences from the 5' end of the APC 
gene were found in YAC 37HG4 but not in YAC 14FH1. However, the 
3' end of the APC gene was found in 14FH1 as weU as 37HG4. The 
yeast artificial chromosome of the present invention designated 
YAC 37HG4 has been deposited with the National Collection of Indus- 
trial and Marine Bacteria (NCIMB), P,0. Box 31, 135 Abbey Road, 
Aberdeen AB9 8DG, Scotland, prior to the filing of this patent applica- 
tion. The NCIMB Accession Number of YAC clone YAC 37HG4 is 
40353. Analogously, the 5' end of the MCC coding region was found in 
YAC clones 19AA9 and 26GC3 but not 24ED6 or 14FHi, while the 3' 
end displayed the opposite pattern. Thus, MCC and APC transcription 
units pointed in opposite directions, with the direction of transcription 
going from centromeric to telomeric in the case of MCC, and telomeric 
to centromeric in the case of APC, PFGE analysis of YAC DNA 
digested with various restriction endonucleases showed that TB2 and 
SRP were between MCC and APC, and that the 3' ends of the coding 
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regions of MCC and APC were separated by approximately 150 kb 
(Figure IB). 

Sequence analysis of the APC cDNA clones revealed an open 
reading frame of 8,535 nucleotides. The 5' end of the ORF contained a 
methionine codon (codon 1) that was preceded by an in-frame stop 
codon 9 bp upstream, and the 3* end was followed by several in-frame 
stop codons. The protein produced by initiation at codon 1 would con- 
tain 2,842 amino acids (Figure 3). The results of database searching 
with the APC gene product were quite complex due to the presence of 
large segments with locally biased amino acid compositions. In spite of 
this, APC could be roughly divided into two domains. The N-terminal 
25% of the protein had a high content of leucine residues (12%) and 
showed local sequence similarities to myosins, various Intermediate 
filament proteins (e.g., desmln. vimentin, neurofilaments) and 
Drosophila armadillo/human plakoglobln. The latter protein is a com- 
ponent of adhesive Junctions (desmosomes) Joining epithelial cells 
(Franke et al., Proc^ Natl. Acad. Sci. U.S.A., Vol. 86, p. 4027 (1989); 
Perfer et al.. Cell, Vol. 63, p. 1167 (1990)) The C-terminal 75% of APC 
(residues 731-2832) is 17% serine by composition with serine residues 
more or less uniformly distributed. This large domain also contains 
local concentrations of charged (mostly acidic) and proline residues. 
There was no indication of potential signal peptides, transmembrane 
regions, or nuclear targeting signals in APC, suggesting a cytoplasmic 
localization. 

To detect short similarities to APC, a database search was per- 
formed using the PAM-40 matrix (Altschul. J. Mol. Bio., Vol. 219, p. 555 
(1991)^ Potentially interesting matches to several proteins were found. 
The most suggestive of these involved the ral2 gene product of yeast, 
which is implicated in the regulation of ras activity (Fukul et al., Mol. 
Cell. Biol., Vol. 9, p. 5617 (1989)). Little is known about how ral2 might 
interact with ras but it is interesting to note the positively-charged 
character of this region in the context of the negatively-charged GAP 
interaction region of ras. A specific electrostatic interaction between 
ras and GAP-related proteins has been proposed. 
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Because of the proximity of the MCC and APC genes, and the 
fact that both are Implicated in colorectal tumorigenesis, we searched 
for similarities between the two predicted proteins. Bourne has previ- 
ously noted that MCC has the potential to form alpha helical coiled 
colls (Nature, Vol. 351, p. 188 (1991). Lupas and colleagues have 
recently developed a program for predicting coiled coil potential from 
primary sequence data (Science, Vol. 252, p. 1162 (1991) and we have 
used their program to analyze both MCC and APC. Analysis of MCC 
Indicated a discontinuous pattern of coiled-coil domains separated by 
putative "hinge" or "spacer" regions similar to those seen in laminin 
and other intermediate filament proteins. Analysis of the APC 
sequence revealed two regions in the N- terminal domain which had 
strong coiled coil-forming potential, and these regions corresponded to 
those that showed local similarities with myosin and IF proteins on 
database searching. In addition, one other putative coiled coil region 
was identified in the central region of APC. The potential for both 
APC and MCC to form coiled coils is interesting in that such structures 
often mediate homo- and hetero-oligomerization. 

Finally, it had previously been noted that MCC shared a short 
similarity with the region of the m3 muscarinic acetylcholine receptor 
(mAChR) known to regulate specificity of G-proteln coupling. The 
APC gene also contained a local similarity to the region of the m3 
mAChR that overlapped with the MCC similarity (Figure 4B). Although 
the similarities to ral2 (Figure 4A) and m3 mAChR (Figure 4B) were not 
statistically significant, they were intriguing in light of previous ot)ser- 
vatlons relating G-proteins to neoplasia. 

Each of the six genes described above was expressed in normal 
colon mucosa, as indicated by their representation in colon cDNA 
libraries* To study expression of the genes in neoplastic colorectal 
epithelium, we employed reverse transcription-polymerase chain reac- 
tion (PCR) assays. Primers based on the sequences of FER, TBI, TB2, 
MCC, and APC were each used to design primers for PCR performed 
with cDNA templates. Each of these genes was found to be expressed 
in normal colon, in each of ten cell lines derived from colorectal can- 
cers, and in tumor cell lines derived from lung and bladder tumors. The 
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ten colorectal cancer cell lines Included eight from patients with spo- 
radic CRC and two from patients with FAP. 
Example 2 

This example demonstrates a genetic analysis of the role of the 
FER gene in FAP and sporadic colorectal cancers. 

We considered FER as a candidate because of its proximity to 
the FAP locus as judged by physical and genetic criteria (see 
Example 1), and its homology to known tyrosine kinases with oncogenic 
potential- Primers were designed to PCR-amplify the complete coding 
sequence of FER from the RNA of two colorectal cancer cell lines 
derived from FAP patients. cDNA was generated from RNA and used 
as a template for PGR, The primers used were 
5*-AGAAGGATCCCTTGTGCAGTGTGGA-3' and 
5'-GACAGGATCCTGAAGCTGAGTTTG-3\ The underlined nucleotides 
were altered from the true FER sequence to create BamHI sites. The 
cell lines used were JW and Difi, Ixjth derived from colorectal cancers 
of FAP patients. (C. Paraskeva. B.G. Buckle, D. Sheer, C,B. Wigley, 
Int, J. Cancer 34, 49 (1984); M.E. Gross et al.. Cancer Res, 51, 1452 
(1991). The resultant 2554 basepair fragments were cloned and 
sequenced in their entirety. The PCR products were cloned in the 
BamHI site of Bluescript SK (Stratagene) and pools of at least 50 clones 
were sequenced en masse using T7 polymerase, as described in Nigro 
et al.. Nature 342, 705 (1989). 

Only a single conservative amino acid change (GTG->CTG, cre- 
ating a val to leu substitution at codon 439) was observed. The region 
surrounding this codon was then amplified from the DNA of individuals 
without FAP and this substitution was found to be a common 
polymorphism, not specifically associated with FAP. Based on these 
results, we considered it unlikely (though still possible) the FER gene 
was responsible for FAP. To amplify the regions surrounding codon 
439, the following primers were used: 5'-TCAGAAAGTGCTGAAGAG-3' 
and 5*-GGAATAATTAGGTCTCCAA-3'. PCR products were digested 
with PstI, which yields a 50 bp fragment If codon 439 is leucine, but 26 
and 24 bp fragments if it is valine. The primers used for sequencing 
were chosen from the FER cDNA sequence in Hao et al., supra . 
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Example 3 

This example demonstrates the genetic analysis of MCC, TB2, 
SRP and APC in FAP and sporadic colorectal tumors. Each of these 
genes is linked and encompassed by contig 3 (see Figure 1). 

Several lines of evidence suggested that this contig was of par- 
ticular interest. First, at least three of the four genes in this contig 
were within the deleted region identified in two FAP patients, (See 
Example 5 infra .) Second, allelic deletions of chromosome 5q21 in spo- 
radic cancers appeared to be centered in this region. (Ashton-Rickardt 
et aL, Oncogene, in press; and Miki et al., Japn, J, Cancer Res,, in 
press.) Some tumors exhibited loss of proximal RFLP markers (up to 
and potentially including the 5* end of MCC), but no loss of markers 
distal to MCC, Other tumors exhibited loss of markers distal to and 
perhaps including the 3' end of MCC, but no loss of sequences proximal 
to MCC. This suggested either that different ends of MCC were 
affected by loss in all such cases, or alternatively, that two genes (one 
proximal to and perhaps including MCC, the other distal to MCC) were 
separate targets of deletion. Third, clones from each of the six FAP 
region genes were used as probes on Southern blots containing tumor 
DNA from patients with sporadic CRC, Only two examples of somatic 
changes were observed In over 200 tumors studied: a 
rearrangement/deletion whose centromeric end was located within the 
MCC gene (Kinzler et al,, supra ) and an 800 bp insertion within the 
APC gene between nucleotides 4424 and 5584. Fourth, point mutations 
of MCC were observed in two tumors (Kinzler et al.) supra strongly 
suggesting that MCC was a target of mutation In at least some sporadic 
colorectal cancers. 

Based on these results, we attempted to search for subtle alter- 
ations of contig 3 genes in patients with FAP, We chose to examine 
MCC and APC, rather than TB2 or SRP, because of the somatic muta- 
tions in MCC and APC noted above. To facilitate the identification of 
subtle alterations, the genomic sequences of MCC and APC exons were 
determined (see Table I), These sequences were used to design primers 
for PCR analysis of constitutional DNA from FAP patients. 
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We first amplified eight exons and surrounding introns of the 
MCC gene in affected individuals from 90 different FAP kindreds. The 
PGR products were analyzed by a ribonuclease (RNase) protein assay. 
In brief, the PGR products were hybridized to in vitro transcribed RNA 
probes representing the normal genomic sequences. The hybrids were 
digested with RNase A, which can cleave at single base pair mis- 
matches within DNA-RNA hybrids, and the cleavage products were 
visualized foUowing denaturing gel electrophoresis. Two separate 
RNase protection analyses were performed for each exon, one with the 
sense and one with the antisense strand. Under these conditions, 
approximately 40% of all mismatches are detectable. Although some 
amino acid variants of MCG were observed in FAP patients, aU such 
variants were found in a smaU percentage of normal individuals. These 
variants were thus unlikely to be responsible for the inheritance of 
FAP. 

We next examined three exons of the APG gene. The three 
exons examined included those containing nt 822-930, 931-1309, and 
the first 300 nt of the most distal exon (nt 1956-2256). PGR and RNase 
protection analysis were performed as described in Kinzler et al. supra , 
using the primers underlined in Table L The primers for nt 1956-2256 
were 5'-GGAAATGGTAAGAGAGAACAA-3' and 

S'-GATGGGAAGGTTGAGGGAG-3'. 

In 90 kindreds, the RNase protection method was used to screen 
for mutations and in an additional 13 kindreds, the PGR products were 
cloned and sequenced to search for mutations not detectable by RNase 
protection. PGR products were cloned into a Bluescript vector modi- 
fied as descritDed in T.A, Hoi ton and M.W. Graham, Nucleic Acids Res. 
19, 1156 (1991). A minimum of 100 clones were pooled and sequenced. 
Five variants were detected among the 103 kindreds analyzed, Gloning 
and subsequent DNA sequencing of the PGR product of patient P2l 
indicated a G to T transition in codon 413 that resulted in a change 
from arginine to cysteine. This amino acid variant was not observed in 
any of 200 DNA samples from individuals without FAP. Gloning and 
sequencing of the PGR product from patients P24 and P34, who demon- 
strated the same abnormal RNase protection pattern Indicated that 
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both had a C to T transition at codon 301 that resulted in a change 
from arginine (CGA) to a stop codon (TGA), This change was not 
present in 200 individuals without FAP, As this point mutation resulted 
in the predicted loss of the recognition site for the enzyme Taq I, 
appropriate PGR products could be digested with Taq I to detect the 
mutation. This allowed us to determine that the stop codon 
co-segregated with disease phenotype in members of the family of P24. 
The inheritance of this change in affected members of the pedigree 
provides additional evidence for the importance of the mutation. 

Cloning and sequencing of the PGR product from FAP patient 
P93 indicated a G to G transversion at codon 279, also resulting in a 
stop codon (change from TGA to TGA). This mutation was not present 
in 200 individuals without FAP. Finally, one additional mutation result- 
ing in a serine (TGA) to stop codon (TGA) at codon 712 was detected in 
a single patient with FAP (patient P60). 

The five germline mutations identified are summarized in 
Table UA, as well as four others discussed in Example 9. In addition to 
these germline mutations, we identified several somatic mutations of 
MCG and APG in sporadic CRG's. Seventeen MGG exons were exam- 
ined in 90 sporadic colorectal cancers by RNase protection analysis. In 
each case where an abnormal RNase protection pattern was observed, 
the corresponding PGR products were cloned and sequenced. This led 
to the identification of six point mutations (two described previously) 
(Kinzler et al., supra ), each of which was not found in the germline of 
these patients (Table IIB). Four of the mutations resulted in amino acid 
substitutions and two resulted in the alteration of splice site consensus 
elements. Mutations at analogous splice site positions in other genes 
have been shown to alter RNA processing in vivo and in vitro . 

Three exons of APG were also evaluated in sporadic tumors. 
Sixty tumors were screened by RNase protection, and an additional 98 
tumors were evaluated by sequencing. The exons examined included nt 
822-930, 931-1309, and 1406-1545 (Table I). A total of three mutations 
were identified, each of which proved to be somatic. Tumor T27 con- 
tained a somatic mutation of GGA (arginine) to TGA (stop codon) at 
codon 33. Tumor T135 contained a GT to GG change at a splice donor 
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Site. Tumor T34 contained a 5 bp insertion (CAGCC between codons 
288 and 289) resulting In a stop at codon 291 due to a frameshif t. 

We serendipitously discovered one additional somatic mutation in 
a colorectal cancer. During our attempt to define the sequences and 
splice patterns of the MCC and APC gene products in colorectal 
epithelial ceils, we cloned cDNA from the colorectal cancer cell line 
SW480. The amino acid sequence of the MCC gene from SW480 was 
Identical to that previously found In clones from human brain. The 
sequence of APC in SW480 cells, however, differed significantly, in 
that a transition at codon 1338 resulted In a charge from glutamine 
(CAG) to a stop codon (TAG). To determine If this mutation was 
somatic, we recovered DNA from archival paraffin blocks of the origi- 
nal surgical specimen (T201) from which the tumor cell line was 
derived 28 years ago. 

DNA was purified from paraffin sections as described in S.E. 
Goelz, S.R. Hamilton, and B. Vogelsteln. Blochem. Blophys. Res. 
Comm. 130, 118 (1985). PGR was performed as described in reference 
24, using the primers 5'-GTTCCAGCAGTGTCACAG-3' and 
5'-GGGAGATTTCGCTCCTGA-3\ A PGR product containing codon 
1338 was amplified from the archival DNA and used to show that the 
stop codon represented a somatic mutation present in the original pri- 
mary tumor and in cell lines derived from the primary and metastatic 
tumor sites, but not from normal tissue of the patient. 

The ten point mutations In the MCC and APC genes so far dis- 
covered in sporadic CRCs are summarized in Table IIB. Analysis of the 
number of mutant and wild-type PGR clones obtained from each of 
these tumors showed that in eight of the ten cases, the wild-type 
sequence was present in approximately equal proportions to the 
mutant. This was confirmed by RFLP analysis using flanking markers 
from chromosome 5q which demonstrated that only two of the ten 
tumors (T135 and T201) exhibited an allelic deletion on chromosome 5q. 
These results are consistent with previous observations showing that 
20-40% of sporadic colorectal tumors had allelic deletions of chromo- 
some 5q. Moreover, these data suggest that mutations of 5q2i genes 
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are not limited to those colorectal tumors which contain allelic dele- 
tions of this chromosome. 
Example 4 

This example characterizes small, nested deletions in DNA from 
two unrelated FAP patients. 

DNA from 40 FAP patients was screened with cosmids that had 
been mapped into a region near the APC locus to identify small dele- 
tions or rearrangements. Two of these cosmids, L5.71 and L5.79, 
hybridized with a 1200 kb NotI fragment in DMAs from most of the FAP 
patients screened. 

The DNA of one FAP patient, 3214, showed only a 940 kb NotI 
fragment instead of the expected 1200 kb fragment. DNA was ana- 
lyzed from four other members of the patient's immediate family; the 
940 kb fragment was present in her affected mother (4711), but not in 
the other, unaffected family members. The mother also carried a nor- 
mal 1200 kb NotI fragment that was transmitted to her two unaffected 
offspring. These otsservations indicated that the mutant polyposis 
allele is on the same chromosome as the 940 kb NotI f ragment, A sim- 
ple interpretation is that APC patients 3214 and 4711 each carry a 260 
kb deletion within the APC locus. 

If a deletion were present, then other enzymes might also be 
expected to produce fragments with altered mobilities. Hybridization 
of LS.79 to Nrul-digested DNAs from both affected members of the 
family revealed a novel Nrul fragment of 1300 kb, in addition to the 
normal 1200 kb Nrul fragment. Furthermore, Mlul fragments in 
patients 3214 and 4711 also showed an increase In size consistent with 
the deletion of an Mlul site. The two chromosome 5 homologs of 
patient 3214 were segregated in somatic cell hybrid lines; HHW1155 
(deletion hybrid) carried the abnormal homolog and HHW1159 (normal 
hybrid) carried the normal homolog. 

Because patient 3214 showed only a 940 kb NotI fragment, she 
had not inherited the 1200 kb fragment present in the unaffected 
father's DNA. This observation suggests that he must be heterozygous 
for, and have transmitted, either a deletion of the L5.79 prot)e region 
or a variant NotI fragment too large to resolve on the gel system. As 
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expected, the hybrid cell line HHW1159, which carries the paternal 
homology revealed no resolved Not fragment when probed with L5.79, 
However, probing of HHW1159 DNA with L5.79 following digestion with 
other enzymes did reveal restriction fragments, demonstrating the 
presence of DNA homologous to the probe. The father is, therefore, 
interpreted as heterozygous for a polymorphism at the Not! site, with 
one chromosome 5 having a 1200 kb Not! fragment and the other hav- 
ing a fragment too large to resolve consistently on the geL The latter 
was transmitted to patient 3214. 

When double digests were used to order restriction sites within 
the 1200 kb NotI fragment, L5.71 and L5.79 were both found to lie on a 
550 kb Notl-Nrul fragment and, therefore, on the same side of an Nrul 
site in the 1200 kb NotI fragment. To obtain genomic representation of 
sequences present over the entire 1200 kb NotI fragment, we con- 
structed a library of small-fragment inserts enriched for sequences 
from this fragment. DNA from the somatic cell hybrid HHW141, which 
contains about 40% of chromosome 5, was digested with NotI and 
elect rophoresed under pulsed-field gel (PFG) conditions; EcoRI frag- 
ments from the 1200 kb region of this gel were cloned into a phage 
vector. Probe Map30 was Isolated from this library. In normal individ- 
uals probe Map30 hybridizes to the 1200 kb NotI fragment and to a 200 
kb Nrxil fragment. This latter hybridization places Map30 distal, with 
respect to the locations of L5.71 and L5,79, to the Nrul site of the 550 
kb Notl-Nnil fragment. 

Because Map30 hybridized to the abnormal, 1300 kb Nrul frag- 
ment of patient 3214, the locus defined by Map30 lies outside the 
hypothesized deletion. Furthermore, in normal chromosomes Map30 
identified a 200 kb Nrul fragment and L5.79 identified a 1200 kb Nrul 
fragment; the hypothesized deletion must, therefore, be removing an 
Nrul site, or sites, lying between Map30 and L5.79, and these two 
probes must flank the hypothesized deletion. A restriction map of the 
genomic region, showing placement of these prol)es, is shown in 
Figure 5. 

A NotI digest of DNA from another FAP patient, 3824, was 
probed with L5.79. In addition to the 1200 kb normal NotI fragment, a 
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fragment of approximately 1100 kb was observed, consistent with the 
presence of a 100 kb deletion in one chromosome 5. In this case, how- 
ever, digestion with Nrul and MIuI did not reveal abnormal bands, indi- 
cating that if a deletion were present, its boundaries must lie distal to 
the Nrul and Mlul sites of the fragments identified by L5.79, Consis- 
tent with this expectation, hybridization of Map30 to DNA from 
patient 3824 identified a 760 kb Mlul fragment in addition to the 
expected 860 kb fragment, supporting the interpretation of a 100 kb 
deletion in this patient. The two chromosome 5 homologs of patient 
3824 were segregated in somatic cell hybrid lines; HHW1291 was found 
to carry only the abnormal homolog and HHW1290 only the normal 
homolog. 

That the 860 kb Mlul fragment identified by Map30 is distinct 
from the 830 kb Mlul fragment identified previously by L5.79 was dem- 
onstrated by hybridization of Map30 and L5.79 to a Notl-Mlul double 
digest of DNA from the hybrid cell (HHW1159) containing the 
nondeleted chromosome 5 homolog of patient 3214. As previously indi- 
cated, this hybrid is interpreted as missing one of the NotI sites that 
define the 1200 kb fragment. A 620 kb Notl-Mlul fragment was seen 
with probe L5.79, and an 860 kb fragment was seen with Map30. 
Therefore, the 830 kb Mlul fragment recognized by prot)e L5.79 must 
contain a NotI site in HHW1159 DNA; because the 860 kb Mlul fragment 
remains intact. It does not carry this NotI site and must be distinct 
from the 830 kb Mlul fragment. 
Example 5 

This example demonstrates the isolation of human sequences 
which span the region deleted in the two unrelated FAP patients char- 
acterized in Example 4. 

A strong prediction of the hypothesis that patients 3214 and 
3824 carry deletions is that some sequences present on normal chromo- 
some 5 homologs would be missing from the hypothesized deletion 
homologs. Therefore, to develop genomic probes that might confirm 
the deletions, as well as to identify genes from the region, YAC clones 
from a con tig seeded by cosmid L5.79 were localized from a library 
containing seven haploid human genome equivalents (Albertsen et al-, 
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Proc. Natl. Acad. Sci, U.S.A., Vol. 87, pp. 4256-4260 (1990)) with 
respect to the hypothesized deletions. Three clones, YACs 57B8, 
310D8, and 183H12, were found to overlap the deleted region. 

Importantly, one end of YAC 57B8 (clone AT57) was found to lie 
within the patient 3214 deletion. Inverse polymerase chain reaction 
(PGR) defined the end sequences of the insert of YAC 57B8. PGR 
primers based on one of these end sequences repeatedly failed to 
amplify DNA from the somatic cell hybrid (HHW1155) carrying the 
deleted homolog of patient 3214, but did amplify a product of the 
expected size from the somatic cell hybrid {HHW1159) carrying the 
normal chromosome 5 homolog- This result supported the interpreta- 
tion that the abnormal restriction fragments found in the DNA of 
patient 3214 result from a deletion. 

Additional support for the hypothesis of deletion in DNA from 
patient 3214 came from subcloned fragments of YAG 183H12, which 
spans the region in question, YU, an EcoRI fragment cloned from 
YAG 183H12, hybridized to the normal, 1200 kb NotI fragment of 
patient 4711, but failed to hybridize to the abnormal, 940 kb NotI frag- 
ment of 4711 or to DNA from deletion cell line HHW1155, This result 
confirmed the deletion in patient 3214. 

Two additional EcoRl fragments from YAG 183H12, YIO and 
Y14, were localized within the patient 3214 deletion by their failure to 
hybridizie to DNA from HHW1155. Prot^e YIO hybridizes to a 150 kb 
Nrul fragment in normal chromosome 5 homologs. Because the 3214 
deletion creates the 1300 kb Nrul fragment seen with the probes L5.79 
and Map30 that flank the deletion, these Nrul sites and the 150 kb Nrul 
fragment lying between must be deleted in patient 3214. Furthermore, 
probe YIO hybridizes to the same 620 kb Notl-Mlul fragment seen with 
probe L5.79 in normal DNA, indicating its location as L5.79-proximal to 
the deleted Mlul site and placing it between the Mlul site and the 
L5.79-proximal Nrul site. The Mlul site must, therefore, lie between 
the Nrul sites that define the 150 kb Nrul fragment (see Figure 5). 

Probe Yll also hybridized to the 150 kb Nrul fragment in the 
normal chromosome 5 homolog, but failed to hybridize to the 620 kb 
Notl-Mlul fragment, placing it L5.79-distal to the Mlul site, but 



SUBSTITUTE SHEET 



wo 92/13103 



- 34 - 



PCT/US92/00376 



proximal to the second Nrul site. Hybridization to the same (860 kb) 
Mlul fragment as Map30 confirmed the localization of probe Yll 
L5.79-distal to the Mlul site. 

Probe yi4 was shown to be L5.79-distal to both deleted Nrul 
sites by virtue of its hybridization to the same 200 kb Nrul fragment of 
the normal chromosome 5 seen with MapSO. Therefore, the order of 
these EcoRI fragments derived from YAC 183H12 and deleted in 
patient 3214, with respect to L5.79 and Map30, is 
L5.79-Y10-Yll-Y14-Map30. 

The 100 kb deletion of patient 3824 was confirmed by the failure 
of aberrant restriction fragments in this DNA to hybridize with probe 
Yll, combined with positive hybridizations to probes YlO and/or Y14. 
YIO and Y14 each hybridized to the 1100 kb NotI fragment of patient 
3824 as well as to the normal 1200 kb NotI fragment, but Yll hybrid- 
ized to the 1200 kb fragment only. In the Mlul digest, probe Y14 
hybridized to the 860 kb and 760 kb fragments of patient 3824 DNA, but 
probe Yll hybridized only to the 860 kb fragment. We conclude that 
the basis for the alteration in fragment size in DNA from patient 3824 
is. Indeed, a deletion. Furthermore, because probes YIO and Y14 are 
missing from the deleted 3214 chromosome, but present on the deleted 
3824 chromosome, and they have t>een shown to flank probe Yll, the 
deletion in patient 3824 must be nested within the patient 3214 
deletion. 

Probes YIO, Yll, Y14 and Map30 each hybridized to YAC 310D8, 
indicating that this YAC spanned the patient 3824 deletion and at a 
minimum, most of the 3214 deletion. The YAC characterizations, 
therefore, confirmed the presence of deletions in the patients and pro- 
vided physical representation of the deleted region. 
Example 6 

This example demonstrates that the MCC coding sequence maps 
outside of the region deleted in the two FAP patients characterized in 
Example 4« 

An intriguing FAP candidate gene, MCC, recently was ascer- 
tained with cosmid L5.71 and was shown to have undergone mutation in 
colon carcinomas (Kinzler et al., supra ). It was therefore of interest to 
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map this gene with respect to the deletions in APC patients. Hybrid- 
ization of MCC probes with an overlapping series of YAC clones 
extending in either direction from L5.71 showed that the 3' end of MCC 
must be oriented toward the region of the two APC deletions. 

Therefore, two 3* cDNA clones from MCC were mapped with 
respect to the deletions: clone iCI (bp 2378-4181) and clone 7 (bp 
2890-3560). Clone ICI contains sequences from the C-terminal end of 
the open reading frame, which stops at nucleotide 2708, as well as 3* 
untranslated sequence. Clone 7 contains sequence that is entirely 3' to 
the open reading frame. Importantly, the entire 3' untranslated 
sequence contained in the cDNA clones consists of a single 2,5 kb exon. 
These two clones were hybridized to DNAs from the YACs spanning the 
FAP region. Clone 7 fails to hybridize to YAC 310D8, although it does 
hybridize to YACs 183H12 and 57B8; the same result was obtained with 
the cDN A ICI. Furthermore, these probes did show hybridization to 
DNAs from both hybrid cell lines (HWW1159 and HWW1155) and the 
lymphoblastoid cell line from patient 3214, confirming their locations 
outside the deleted region. Additional mapping experiments suggested 
that the 3* end of the MCC cDNA clone contig is likely to be located 
more than 45 kb from the deletion of patient 3214 and, therefore, more 
than 100 kb from the deletion of patient 3824. 
Example 7 

This example identifies three genes within the deleted region of 
chromosome 5 in the two unrelated FAP patients characterized in 
Example 4. 

Genomic clones were used to screen cDNA libraries in three 
separate experiments. One screening was done with a phage clone 
derived from YAC 310D8 known to span the 260 kb deletion of patient 
3214. A large-insert phage library was constructed from this YAC; 
screening with Yll identified X205, which mapped within both dele- 
tions. When clone X205 was used to prol)e a random-, plus oligo(dT)-, 
primed fetal brain cDNA library (approximately 300,000 phage), six 
cDNA clones were isolated and each of them mapped entirely within 
both deletions. Sequence analysis of these six clones formed a single 
cDNA contig, but did not reveal an extended open reading frame. One 
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of the six cDNAs was used to isolate more cDNA clones, some of which 
crossed the LS.Tl-proxlmal breakpoint of the 3824 deletion, as indi- 
cated by hybridization to both chromosome of this patient. These 
clones also contained an open reading frame, indicating a transcrip- 
tional orientation proximal to distal with respect to L5,71. This gene 
was named DPI (deleted in polyposis 1). This gene is identical to TB2 
described above. 

cDNA walks yielded a cDNA contig of 3.0-3.5 kb, and included 
two clones containing terminal poly(A) sequences. This size corre- 
sponds to the 3,5 kb band seen by Northern analysis. Sequencing of the 
first 3163 bp of the cDNA contig revealed an open reading frame 
extending from the first base to nucleotide 631, followed by a 2.5 kb 3' 
untranslated region. The sequence surrounding the methionine codon 
at base 77 conforms to the Kozak consensus of an initiation methionine 
(Kozak, 1984). Failed attempts to walk farther, coupled with the simi- 
larity of the lengths of isolated cDNA and mRNA, suggested that the 
NH2-terminus of the DPI protein had been reached. Hybridization to a 
combination of genomic and YAC DNAs cut with various enzymes indi- 
cated the genomic coverage of DPI to be approximately 30 kb. 

Two additional probes for the locus, YS-11 and YS-39, which had 
been ascertained by screening of a cDNA Ubrary with an independent 
YAC probe identified with MCC sequences adjacent to L5.71, were 
mapped into the deletion region. YS-39 was shown to be a cDNA iden- 
tical in sequence to DPI. Partial characterization of YS-11 had shown 
that 200 bp of DNA sequence at one end was identical to sequence cod- 
ing for the 19 kd protein of the ribosomal signal recognition particle, 
SRP19 (Lingelbach et al., supra ). Hybridization experiments mapped 
YS-11 within both deletions. The sequence of this clone, however, was 
found to be complex. Although 454 bp of the 1032 bp sequence of 
YS-11 were identical to the GenBank entry for the SRP19 gene, 
another 578 bp appended 5' to the SRP19 sequence was found to consist 
of previously unreported sequence containing no extended open reading 
frames. This suggested that YS-ll was either a chimeric clone con- 
taining two independent inserts or a clone of an incompletely processed 
or aberrant message. If YS-ll were a conventional chimeric clone, the 
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independent segments would not be expected to map to the same physi- 
cal region. The segments resulting from anomalous processing of a 
continuous transcript, however, would map to a single chromosomal 
region. 

Inverse PGR with primers specific to the two ends of YS-ll, the 
SRP19 end and the unidentified region, verified that both sequences 
map within the YAC 3I0D8; therefore, YS-II is most likely a clone of 
an immature or anomalous mRNA species. Subsequently, both ends 
were shown to lie with the deleted region of patient 3824, and YS-ll 
was used to screen for additional cDNA clones. 

Of the 14 cDNA clones selected from the fetal brain library, one 
clone, V5, was of particular interest in that it contained an open read- 
ing frame throughout, although it included only a short identity to the 
first 78 5* bases of the YS-ll sequence. Following the 78 bp of identi- 
cal sequence, the two cDNA sequences diverged at an AG. Further- 
more, divergence from genomic sequence was also seen after these 78 
bp» suggesting the presence of a splice junction, and supporting the 
view that YS-ll represents an irregular message. 

Starting with V5, successive 5' and 3* walks were performed; the 
resulting cDNA contig consisted of more than 100 clones, which 
defined a new transcript, DP2. Clones walking in the 5' direction 
crossed the 3824 deletion breakpoint farthest from L5.71; since its 3* 
end is closer to this cosmid than its 5' end, the transcriptional orienta- 
tion of DP2 is opposite to that of MCC and DPI. 

The third screening approach relied on hybridization with a 120 
kb Mlul fragment from YAC 57B8. This fragment hybridizes with probe 
YIl and completely spans the 100 kb deletion in patient 3824. the 
fragment was purified on two preparative PFGs, labeled, and used to 
screen a fetal brain cDNA library. A number of cDNA clones previ- 
ously identified in the development of the DPI and DP2 contigs were 
reascertained. However, 19 new cDNA clones mapped into the patient 
3824 deletion. Analysis indicated that these 19 formed a new contig, 
DP3, containing a large open reading frame, 

A clone from the 5* end of this new cDNA contig hybridized to 
the same EcoRI fragment as the 3' end of DP2. Subsequently, the DP2 
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and DPS contigs were connected by a single 5* walking step from DP3, 
to form the single contig DP2.5. The complete nucleotide sequence of 
DP2.5 is shown in Figure 9. 

The consensus cDNA sequence of DP2.5 suggests that the entire 
coding sequence of DP2,5 has been obtained and is 8532 bp long. The 
most 5' ATG codon occurs two codons from an in-frame stop and con- 
forms to the Kozak initiation consensus (Kozak, Nucl. Acids. Res., 
Vol. 12, p. 857-872 1984). The 3* open reading frame breaks down over 
the final 1.8 kb, giving multiple stops in all frames. A poly{A) sequence 
was found in one clone approximately 1 kb into the 3* untranslated 
region, associated with a polyadenylation signal 33 bp upstream {posi- 
tion 9530). The open reading frame is almost identical to that identi- 
fied as A PC above. 

An alternatively spliced exon at nucleotide 934 of the DP2.5 
transcript is of potential interest, it was first discovered by noting 
that two classes of cDNA had been isolated. The more abundant cDNA 
class contains a 303 bp exon not included in the other. The presence in 
vivo of the two transcripts was verified by an exon connection experi- 
ment. Primers flanking the alternatively spliced exon were used to 
amplify, by PCR, cDNA prepared from various adult tissues. Two PGR 
products that differed in size by approximately 300 bases were ampli- 
fied from sdl the tissues tested; the larger product was always more 
abundant than the smaUer. 
Example 8 

This example demonstrates the primers used to identify subtle 
mutations in DPI, SRP19, and DP25. 

To obtain DNA sequence adjacent to the exons of the genes DPI, 
DP2,5, and SRP19. sequencing substrate was obtained by inverse PCR 
amplification of DNAs from two YACs, 310D8 and 183H12, that span 
the deletions. Ligation at low concentration cyclized the restriction 
enzyme-digested YAC DNAs. Oligonucleotides with sequencing tails, 
designed in inverse orientation at intervals along the cDNAs, primed 
PCR amplification from the cyclized templates. Comparison of these 
DNA sequences with the cDNA sequences placed exon boundaries at 
the divergence points. SRP19 and DPI were each shown to have five 



eimQTiTtfTr currr 



wo 92/13103 



- 39 - 



PCT/US92/00376 



exons- DP2.5 consisted of 15 exons. The sequences of the 
oligonucleotides synthesized to provide PGR amplification primers for 
the exons of each of these genes are listed in Table m. With the excep- 
tion of exons 1, 3, 4, 9, and 15 of DP2.5 (see below), the primer 
sequences were located in intron sequences flanking the exons. The 5* 
primer of exon 1 Is complementary to the cDNA sequence, but extends 
just into the 5' Kozak consensus sequence for the initiator methionine, 
allowing a survey of the translated sequences. The 5* primer of exon 3 
is actually In the 5' coding sequences of this exon, as three separate 
intronic primers simply would not amplify. The 5' primer of exon 4 just 
overlaps the 5' end of this exon, and we thus fail to survey the 19 most 
5» bases of this exon. For exon 9, two overlapping primer sets were 
used, such that each had one end within the exon. For exon 15, the 
large 3* exon of DP2.5, overlapping primer pairs were placed along the 
length of the exon; each pair amplified a product of 250-400 bases. 
Example 9 

This example demonstrates the use of single stranded conforma- 
tion polymorphism (SSCP) analysis as described by Orlta et al. Proc. 
Natl. Acad. Sci. U.S.A., Vol. 86. pp. 2766-70 (1989) and Genomics, 
Vol. 5, pp. 874-879 (1989) as applied to DPI, SRP19 and DP2.5. 

SSCP analysis identifies most single- or multiple-base changes in 
DNA fragments up to 400 bases in length. Sequence alterations are 
detected as shifts In electrophoretic mobility of single-stranded DNA 
on nondenaturing acrylamide gels; the two complementary strands of a 
DNA segment usually resolve as two SSCP conformers of distinct 
mobilities. However, if the sample is from an individual heterozygous 
for a base-pair variant within the amplified segment, often three or 
more bands are seen. In some cases, even the sample from a 
homozygous individual will show multiple bands. Base-pair-change 
variants are identified by differences in pattern among the DNAs of 
the sample set. 

Exons of the candidate genes were amplified by PCR from the 
DNAs of 61 unrelated FAP patients and a control set of 12 normal indi- 
viduals. The five exons from DPI revealed no unique conformers in the 
FAP patients, although common conformers were observed with exons 
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2 and 3 in some Individuals of both affected and control sets, indicating 
the presence of DN A sequence polymorphisms. Likewise, none of the 
five exons of SRP19 revealed unique conformers in DNA from FAP 
patients in the test panel. 

Testing of exons 1 through 14 and primer sets A through N of 
exon 15 of the DP2,5 gene, however, revealed variant conformers spe- 
cific to FAP patients in exons 7, 8, 10, 11, and 15. These variants were 
in the unrelated patients 3746, 3460, 3827, 3712, and 3751, respectively. 
The PC R-SSCP procedure was repeated for each of these exons in the 
five affected individuals and in an expanded set of 48 normal controls. 
The variant bands were reproducible in the FAP patients but were not 
observed in any of the control DNA samples. Additional variant con- 
formers in exons 11 and 15 of the DP2.5 gene were seen; however, each 
of these was found in both the affected and control DNA sets. The five 
sets of conformers unique to the FAP patients were sequenced to 
determine the nucleotide changes responsible for their altered mobili- 
ties. The normal conformers from the host individuals were sequenced 
also. Bands were cut from the dried acrylamide gels, and the DNA was 
eluted. PCR amplification of these DNAs provided template for 
sequencing. 

The sequences of the unique conformers from exons 7, 8, 10, and 
11 of DP2.5 revealed dramatic mutations in the DP2.5 gene. The 
sequence of the new mutation creating the exon 7 conformer in patient 
3746 was shown to contain a deletion of two adjacent nucleotides, at 
positions 730 and 731 in the cDNA sequence (Figure 7). The normal 
sequence at this spUce Junction is CAG GGTCA (intronic sequence 
underUned), with the intron-exon boundary between the two repetitions 
of AG. The mutant allele in this patient has the sequence CAGGTCA, 
Although this change is at the 5' splice site, comparison with known 
consensus sequences of splice Junctions would suggest that a functional 
splice Junction is maintained. If this new splice junction were func- 
tional, the mutation would introduce a frameshift that creates a stop 
codon 15 nucleotides downstream. If the new splice junction were not 
functional, messenger processing would be significantly altered. 
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To confirm the 2-base deletion, the PGR product from FAP 
patient 3746 and a control DNA were electrophoresed on an 
acrylamide-urea denaturing gel, along with the products of a sequenc- 
ing reaction. The sample from patient 3746 showed two bands differing 
in size by 2 nucleotides, with the larger band identical in mobility to 
the control sample; this result was independent confirmation that 
patient 3746 is heterozygous for a 2 bp deletion. 

The unique conformer found in exon 8 of patient 3460 was found 
to carry a C-T transition, at position 904 in the cDNA sequence of 
DP2.5 (shown in Figure 7), which replaced the normal sequence of CGA 
with TGA, This point mutation, when read in frame, results in a stop 
codon replacing the normal arginine codon. This single-base change 
had occurred within the context of a CG dimer, a potential hot spot for 
mutation (Barker et al., 1984). 

The conformer unique to FAP patient 3827 in exon 10 was found 
to contain a deletion of one nucleotide (1367, 1368, or 1369) when com- 
pared to the normal sequence found in the other bands on the SSCP gel. 
This deletion, occurring within a set of three T's, changed the sequence 
from CTTTCA to CTTCA; this 1 base f rameshif t creates a downstream 
stop within 30 bases. The PGR product amplified from this patient's 
DNA also was electrophoresed on an acrylamide-urea denaturing gel, 
along with the PGR product from a control DNA and products from a 
sequencing reaction. The patient's PGR product showed two bands 
differing by 1 bp in length, with the larger identical in mobility to the 
PGR product from the normal DNA; this result confirmed the presence 
of a 1 bp deletion in patient 3827. 

Sequence analysis of the variant conformer of exon 11 from 
patient 3712 revealed the substitution of a T by a G at position 1500, 
changing the normal tyrosine codon to a stop codon. 

The pair of conformers observed in exon 15 of the DP2.5 gene 
for FAP patient 3751 also was sequenced. These conformers were 
found to carry a nucleotide substitution of G to G at position 5253, the 
third base of a valine codon. No amino acid change resulted from this 
substitution, suggesting that this conformer reflects a genetically sUent 
polymorphism. 
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The observation of distinct inactivating mutations in the DP2.5 
gene in four unrelated patients strongly suggested that DP2.5 is the 
gene involved in FAP. These mutations are summarized In Table IIA. 
Example 10 

This example demonstrates that the mutations identified in the 
DP2.5 (APC) gene segregate with the FAP phenotype. 

Patient 3746, described above as carrying an APC allele with a 
frameshif t mutation, is an affected offspring of two normal parents. 
Colonoscopy revealed no polyps in either parent nor among the 
patient's three siblings. 

DNA samples from both parents, from the patient's wife, and 
from their three children were examined. SSCP analysis of DNA from 
both of the patient's parents displayed the normal pattern of conform- 
ers for exon 7, as did DNA from the patients's wife and one of his off- 
spring. The two other children, however, displayed the same new con- 
formers as their affected father. Testing of the patient and his parents 
with highly polymorphic VNTR (variable number of tandem repeat) 
markers showed a 99.98% likelihood that they are his biological 
parents. 

These observations confirmed that this novel conformer, known 
to reflect a 2 bp deletion mutation in the DP2.5 gene, appeared sponta- 
neously with FAP in this pedigree and was transmitted to two of the 
children of the affected individual. 
Example 11 

This example demonstrates polymorphisms in the APC gene 
which appear to be unrelated to disease (FAP), 

Sequencing of variant conformers found among controls as well 
as individuals with APC has revealed the following polymorphisms in 
the APC gene: first, in exon 11, at position 1458, a substitution of T to 
C creating an Rsal restriction site but no amino acid change; and sec- 
ond, in exon 15, at positions 5037 and 5271, substitutions of A to G and 
G to T, respectively, neither resulting in amino acid substitutions. 
These nucleotide polymorphisms in the APC gene sequence may be 
useful for diagnostic purposes. 
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Example 12 

This example shows the structure of the A PC gene. 

The structure of the APC gene is schematically shown in 
Figure 8, with flanking intron sequences indicated. 

The continuity of the very large (6,5 kb), most 3' exon in DP2.5 
was shown In two ways. First, inverse PGR with primers spanning the 
entire length of this exon revealed no divergence of the cDNA 
sequence from the genomic sequence. Second, PGR amplification with 
converging primers placed at intervals along the exon generated prod- 
ucts of the same size whether amplified from the originally isolated 
cDNA, cDNA from various tissues, or genomic template. Two forms of 
exon 9 were found in DP2.5: one is the complete exon; and the other, 
labeled exon 9A, Is the result of a splice into the interior of the exon 
that deletes bases 934 to 1236 in the mRNA and removes 101 amino 
acids from the predicted protein (see Figure 7). 
Example 13 

This example demonstrates the mapping of the FAP deletions 
with respect to the APC exons. 

Somatic cell hybrids carrying the segregated chromosomes 5 
from the 100 kb (HHW1291) and 260 kb (HHW1155) deletion patients 
were used to determine the distribution of the APC genes exons across 
the deletions. DNAs from these cell lines were used as template, along 
with genomic DMA from a normal control, for PCR-based amplification 
of the APC exons. 

PGR analysis of the hybrids from the 260 kb deletion of patient 
3214 showed that all but one (exon l) of the APC exons are removed by 
this deletion, PGR analysis of the somatic cell hybrid HHW1291, carry- 
ing the chromosome 5 homolog with the 100 kb deletion from patient 
3824, revealed that exons 1 through 9 are present but exons 10 through 
15 are missing. This result placed the deletion breakpoint either 
Ijetween exons 9 and 10 or within exon 10. 
Example 14 

This example demonstrates the expression of alternately spliced 
APC messenger in normal tissues and in cancer cell lines. 
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Tissues that express the APC gene were identified by PCR 
amplification of cDNA made to mRNA with primers located within 
adjacent APC exons. In addition, PCR primers that flank the alterna- 
tively spliced exon 9 were chosen so that the expression pattern of 
both splice forms could t>e assessed* All tissue types tested (brain, lung, 
aorta, spleen, heart, kidney, liver, stomach, placenta, and colonic 
mucosa) and cultured cell lines (lymphoblasts, HL60, and 
choriocarcinoma) expressed both splice forms of the APC gene. We 
note, however, that expression by lymphocytes normally residing in 
some tissues, including colon, prevents unequivocal assessment of 
expression. The large mRNA, containing the complete exon 9 rather 
than only exon 9A, appears to t>e the more abundant message. 

Northern analysis of poly(A)-selected RNA from lymphoblasts 
revealed a single band of approximately 10 kb, consistent with the size 
of the sequenced cDNA. 
Example 15 

This example discusses structural features of the APC protein 
predicted from the sequence. 

The cDNA consensus sequence of APC predicts that the longer, 
more abundant form of the message codes for a 2842 or 28444 amino 
acid peptide with a mass of 311.8 kd. This predicted APC peptide was 
compared with the current data bases of protein and DNA sequences 
using both Intelligenetics and GCG software packages. No genes with a 
high degree of amino acid sequence similarity were found. Although 
many short (approximately 20 amino acid) regions of sequence similar- 
ity were uncovered, none was sufficently strong to reveal which, if 
any, might represent functional homology. Interestingly, multiple simi- 
larities to myosins and keratins did appear. The APC gene also was 
scanned for sequence motifs of known function; although multiple 
glycosylation, phosphorylation, and myristoylation sites were seen, 
their significance is uncertain. 

Analysis of the APC peptide sequence did identify features 
important in considering potential protein structure. Hydropathy plots 
(Kyte and Doolittle, J. Mol. Biol. Vol. 157, pp. 105-132 (1982)) indicate 
that the APC protein is notably hydrophilic. No hydrophobic domains 
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suggesting a signal peptide or a membrane-spanning domain were 
found. Analysis of the first 1000 residues indicates that a-helicai rods 
may form (Cohen and Parry, Trends Biochem. Sci. Vol, 77, pp. 245-248 
(1986); there is a scarcity of proline residues and, there are a number of 
regions containing heptad repeats (apolar-X-X-apolar-X-X-X). Inter- 
estingly, in exon 9A, the deleted form of exon 9, two heptad repeat 
regions are reconnected in the proper heptad repeat frame, deleting 
the intervening peptide region. After the first 1000 residues, the high 
proline content of the remainder of the peptide suggests a compact 
rather than a rod-like structure- 

The most prominent feature of the second lOOO residues is a 20 
amino acid repeat that Is iterated seven times with semiregular spacing 
(Table 4). The intervening sequences between the seven repeat regions 
contained 114, 116, 151, 205, 107, and 58 amino acids, respectively. 
Finally, residues 2200-24000 contain a 200 amino acid basic domain. 
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SEQUENCE LISTING 



<1) GENERAL INFORMATION: 

(i) APPLICANT: ALBERTiSEN/ HANS 
ANAND, RAKESH 
CARLSON, MARY 
GRODEN, JOANNA 
HEDGE, PHILIP J« 
JOSLYN, GEOFF 
KINZLER, KENNETH 
MARKHAM, ALEXANDER F. 
NAKAMURA, YUSUKE 
THLIVERIS, ANDREW 

(ii) TITLE OF INVENTION: INHERITED AND SOMATIC MUTATIONS OF APC 
GENE IN COLORECTAL CANCER IN HUMANS 

(ill) NUMBER OF SEQUENCES: 94 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Banner, Birch, McKie & Beckett 

(B) STREET: 1001 G Street, NW 

(C) CITY: Weehington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20001-4598 

<v} COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy diek 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

<D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/741,940 

(B) FILING DATE: 08-AUG-1991 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Kagan, Sarah A. 

(B) REGISTRATION NUMBER: 32,141 

(C) REFERENCE /DOCKET NUMBER: 1107.035574 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-508-9100 

(B) TELEFAX: 202-508-9299 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE0NES5: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



SUBSTITUTE SHEET 



wo 92/13103 



47- 



PCT/US92/00376 



fvii) IMMEDIATE SOURCE: 

(B) CLONE: OP2.5(APC} 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 34,.8S62 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGACTCGGAA ATGAGCTCCA AGGCTAGCCA ACC ATG OCT CCA OCT TCA TAT GAT 54 

Met Ala Ala Ala Ser Tyr Asp 
1 5 

CAC TTG TTA AAC CAA CTT GAG CCA CTC AAG ATG GAG AAC TCA AAT CTT 102 
Gin Leu Leu Ly» Gin Val Glu Ala Leu Lye Met Clu Aen Ser A»n Leu 
10 15 20 

CGA CAA GAG CTA GAA GAT AAT TCC AAT CAT CTT ACA AAA CTC GAA ACT 150 
Ara Gin Glu Leu Glu Aep Aen Ser Asn Hla Leu Thr Lys Leu Glu Thr 
25 30 35 

GAG GCA TCT AAT ATG AAG GAA GTA CTT AAA CAA CTA CAA GGA ACT ATT 198 
Glu Ala Ser Aan Met Lye Glu Val Leu Lya Cln Leu Gin Gly Ser lie 
40 45 SO 55 

GAA GAT GAA GCT ATG GCT TCT TCT GGA CAG ATT GAT TTA TTA GAG CGT 24 6 

Glu Asp Glu Ala Met Ala Ser Ser Gly Gin lie Aap Leu Leu Glu Arg 

60 65 70 

CTT AAA GAG CTT AAC TTA GAT AGC ACT AAT TTC CCT GGA GTA AAA CTG 294 
Leu Lye Glu Leu Aan Leu Aap Ser Ser Aan Phe Pro Gly Val Lya Leu 
75 80 85 

CGG TCA AAA ATG TCC CTC CGT TCT TAT CGA AGC CGG GAA GGA TCT GTA 342 
Arg Ser Lya Met Ser Leu Arg Ser Tyr Gly Ser Arg Glu Gly Ser Val 
90 95 100 

TCA AGC CGT TCT GGA GAG TGC ACT CCT GTT CCT ATG GGT TCA TTT CCA 390 
Ser Ser Arg Ser Gly Glu Cya Ser Pro Val Pro Met Gly Ser Phe Pro 
105 110 115 

AGA AG A GGG TTT GTA AAT GCA AGC AG A CAA ACT ACT CCA TAT TTA GAA 438 
Arg Arg Gly Phe Val Aan Gly Ser Arg Glu Ser Thr Gly Tyr Leu Glu 
120 125 130 135 

CAA CTT GAG AAA GAG AGG TCA TTC CTT CTT GCT GAT CTT CAC AAA GAA 486 
Glu Leu Glu Lya Glu Arg Ser Leu Leu Leu Ala Aap Leu Aap Lya Glu 
140 145 150 

CAA AAG GAA AAA CAC TCC TAT TAC GCT CAA CTT CAG AAT CTC ACT AAA 534 
Clu Lya Glu Lya Aap Trp Tyr Tyr Ala Gin Leu Gin Aan Leu Thr Lya 
155 160 165 

AGA ATA GAT ACT CTT CCT TTA ACT GAA AAT TTT TCC TTA CAA ACA GAT 582 
Arg He Aap Ser Leu Pro Leu Thr Glu Aan Phe Ser Leu Gin Thr Aap 
170 175 180 

TTG ACC AGA AGG CAA TTG GAA TAT GAA GCA AGC CAA ATC AGA GTT GCG 630 
Leu Thr Arg Arg Gin Leu Glu Tyr Glu Ala Arg Gin He Arg Val Ala 
185 190 195 
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ATC CAA GAA CAA CTA GGT ACC TGC CAG GAT ATC GAA AAA CCA CCA CAG 678 

Met Clu Clu Gin Leu Gly Thr Cys Gin Asp Met Glu Lye Arg AIa Gin 
200 205 210 215 

CGA AG A ATA GCC AG A ATT CAG CAA ATC GAA AAG GAC ATA CTT CGT ATA 726 
Arg Arg lie Ala Arg lie Gin Gin He Glu Lye Asp He Leu Arg lie 
220 225 230 

CGA CAG CTT TTA CAG TCC CAA GCA ACA GAA GCA GAG AGG TCA TCT CAG 7 74 

Arg Gin Leu Leu Gin Ser Gin Ala Thr Glu Ala Glu Arg Ser Ser Gin 
235 240 245 

AAC AAG CAT GAA ACC GCC TCA CAT GAT GCT GAG CGG CAG AAT GAA CGT 822 
Aan Lya Kia Glu Thr Gly Ser Hia Aap Ala Glu Arg Gin Aan Glu Gly 
250 255 260 

CAA GGA GTG GGA GAA ATC AAC ATG GCA ACT TCT GGT AAT GGT CAG GGT 870 
Gin Gly Val Gly Glu He Aan Met Ala Thr Ser Gly Aan Gly Gin Gly 
265 270 275 

TCA ACT ACA CGA ATC GAC CAT GAA ACA GCC ACT CTT TTG ACT TCT ACT 918 
Ser Thr Thr Arg Met Asp His Glu Thr Ala Ser Val Leu Ser Ser Ser 
280 285 290 295 

AGC ACA CAC TCT GCA CCT CCA AGG CTG ACA ACT CAT CTG GGA ACC AAG 966 
Ser Thr His Ser Ala Pro Arg Arg Leu Thr Ser Hia Leu Gly Thr Lya 
300 305 310 

GTG GAA ATG GTG TAT TCA TTG TTG TCA ATG CTT GGT ACT CAT GAT AAG 1014 
Val Glu Met: Val Tyr Ser Leu Leu Ser Met Leu Gly Thr Hia Aap Lya 
315 320 325 

GAT GAT ATG TCG CGA ACT TTG CTA GCT ATG TCT ACC TCC CAA GAC ACC 1062 
Aap Asp Met Ser Arg Thr Leu Leu Ala Met Ser Ser Ser Gin Aap Ser 
330 335 340 

TCT ATA TCC ATG CGA CAG TCT GGA TGT CTT CCT CTC CTC ATC CAG CTT 1110 
Cys lie Ser Met Arg Gin Ser Gly Cye Leu Pro Leu Leu He Gin Leu 
345 350 355 

TTA CAT GGC AAT CAC AAA GAC TCT CTA TTG TTG GGA AAT TCC CGG GGC 1158 
Leu Hia Gly Asn Asp Lya Aap Ser Val Leu Leu Gly Aan Ser Arg Gly 
360 365 370 375 

ACT AAA GAG CCT CGG GCC AGG GCC ACT GCA CCA CTC CAC AAC ATC ATT 1206 
Ser Lya Glu Ala Arg Ala Arg Ala Ser Ala Ala Leu Hia Aan He He 
380 385 390 

CAC TCA CAG CCT GAT GAC AAC AGA GGC AGG CGT GAA ATC CGA CTC CTT 1254 
Hia Ser Gin Pro Asp Asp Lya Arg Gly Arg Arg Glu He Arg Val Leu 
395 400 405 

CAT CTT TTG GAA CAG ATA CGC GCT TAC TCT CAA ACC TGT TCG CAG TCG 1302 
Hia Leu Leu Clu Gin He Arg Ala Tyr Cya Glu Thr Cya Trp Glu Trp 
410 415 420 

CAG GAA GCT CAT GAA CCA CGC ATG GAC CAG GAC AAA AAT CCA ATG CCA 1350 
Gin Glu Ala Hia Glu Pro Gly Met Aap Gin Aap Lya Asn Pro Met Pro 
425 430 435 
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GCT CCT GTT CAA CAT CAG ATC TCT CCT CCT CTC TCT GTT CTA ATG AAA 

Ala Pro V*l Glu His Cln He Cya Pro Al* V«l Cy0 Val Leu Met Lye 

440 ^45 450 455 



139a 



CTT TCA TTT GAT CAA CAC CAT AGA CAT CCA ATC AAT GAA CTA CGC CCA 1446 

Leu Ser Phe Aap Clu Glu Hie Arg Hia Ala Met Aan Clu Leu Cly Cly 
460 470 

CTA CAG GCC ATT CCA GAA TTA TTC CAA CTG CAC TCT CAA ATC TAT GGG 1494 

Leu Gin Ala He Ala Clu Leu Leu Cln Val Aap Cya Clu Met Tyr Cly 
475 480 485 

CTT ACT AAT CAC CAC TAG ACT ATT ACA CTA AGA CCA TAT CCT CCA ATG 1542 

Leu Thr Aen Aap Hia Tyr Ser He Thr Leu Arg Arg Tyr Ala Cly Met 
490 495 500 

GCT TTC ACA AAC TTG ACT TTT CCA CAT CTA GCC AAC AAC CCT ACC CTA 1590 

Ala Leu Thr Aan Leu Thr Phe Cly Aap Val Ala Aan Lya Ala Thr Leu 

505 510 515 

TGC TCT ATC AAA GCC TGC ATC ACA CCA CTT CTC GCC CAA CTA AAA TCT 1638 

Cys Ser Met Lya Gly Cya Met Arg Ala Leu Val Ala Cln Leu Lya Ser 
520 525 530 535 

GAA ACT CAA CAC TTA CAG CAC CTT ATT CCA ACT CTT TTC ACC AAT TTG 1686 

Glu Ser Clu Aap Leu Cln Gin Val He Ala Ser Val Leu Arg Aan Leu 
540 545 550 

TCT TGC CCA CCA GAT CTA AAT ACT AAA AAC ACC TTC CCA GAA GTT CCA 1734 

Ser Trp Arg Ala Aap Val Aan Ser Lya Lya Thr Leu Arg Clu Val Cly 
555 560 565 

ACT CTC AAA CCA TTC ATC GAA TCT CCT TTA CAA CTT AAA AAC GAA TCA 1782 

Ser Val Lya Ala Leu Met Clu Cya Ala Leu Clu Val Lya Lya Clu Ser 
570 575 580 

ACC CTC AAA ACC CTA TTC ACT GCC TTA TGC AAT TTG TCA CCA CAT TGC 1830 

Thr Leu Lya Ser Val Leu Ser Ala Leu Trp Aan Leu Ser Ala Hia . Cya 

585 590 595 

ACT GAG AAT AAA CCT CAT ATA TCT CCT CTA CAT CCT CCA CTT CCA TTT 1878 

Thr Glu Aan Lya Ala Aap He Cya Ala Val Aap Gly Ala Leu Ala Phe 
600 605 610 615 

TTC GTT CCC ACT CTT ACT TAC CGC ACC CAC ACA AAC ACT TTA GCC ATT 1926 

Leu Val Cly Thr Leu Thr Tyr Arg Ser Cln Thr Aan Thr Leu Ala He 
620 625 630 

ATT CAA ACT CCA CCT CCC ATA TTA CCG AAT CTC TCC ACC TTC ATA CCT 1974 

He Glu Ser Cly Cly Cly He Leu Arg Aan Val Ser Ser Leu He Ala 
635 640 645 

ACA AAT CAG CAC CAC ACC CAA ATC CTA AGA CAC AAC AAC TCT CTA CAA 2022 

Thr Aan Clu Aap Hia Arg Cln He Leu Arg Clu Aan Aan Cya Leu Cln 
650 655 660 

ACT TTA TTA CAA CAC TTA AAA TCT CAT ACT TTG ACA ATA CTC ACT AAT 2070 

Thr Leu Leu Gin His Leu Lya Ser His Ser Leu Thr He Val Ser Aan 

665 670 675 
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CCA TOT CCA ACT TTC TCG AAT CTC TCA CCA ACA AAT CCT AAA CAC CAG 2118 
Ala CvB Cly Thr Leu Trp Aen Leu Ser Ala Arg Aan Pro Lya A«p Gin 
680 685 690 695 

CAA CCA TTA TGC CAC ATG CCC CCA CTT AGC ATG CTC AAC AAC CTC ATT 2166 
Glu Ala Leu Trp Asp Met Cly Ala Val Ser Met Leu Lya Asn Leu He 
700 705 710 

CAT TCA AAC CAC AAA ATG ATT CCT ATG CCA ACT CCT CCA CCT TTA ACG 2214 
Hie Ser Lye Hie Lys Met He Ala Met Cly Ser Ala Ala Ala Leu Arg 
715 720 725 

AAT CTC ATG CCA AAT AGC CCT CCC AAC TAC AAC CAT GCC AAT ATT ATG 2262 
Aan Leu Met Ala Aen Arg Pro Ala Lye Tyr Lya Aap Ala Asn lie Met 
730 735 740 

TCT CCT CCC TCA AGC TTC CCA TCT CTT CAT CTT AGC AAA CAA AAA GCC 2310 
Ser Pro Cly Ser Ser Leu Pro Ser Leu His Val Arg Lys Gin Lys Ala 
745 750 755 

CTA CAA CCA CAA TTA CAT CCT CAC CAC TTA TCA CAA ACT TTT CAC AAT 2358 
Leu Clu Ala Glu Leu Asp Ala Gin His Leu Ser Glu Thr Phe Asp Asn 
760 765 770 775 

ATA CAC AAT TTA ACT CCC AAC CCA TCT CAT CCT ACT AAC CAG ACA CAC 2406 
He Asp Asn Leu Ser Pro Lys Ala Ser His Arg Ser Lys Gin Arg His 
780 785 790 

AAC CAA ACT CTC TAT CCT CAT TAT CTT TTT CAC ACC AAT CCA CAT CAT 2454 
Lys Gin Ser Leu Tyr Cly Asp Tyr Val Phe Asp Thr Asn Arg His Asp 
795 800 805 

GAT AAT AGC TCA CAC AAT TTT AAT ACT GCC AAC ATG ACT CTC CTT TCA 2502 
Asp Asn Arg Ser Asp Asn Phe Asn Thr Gly Asn Met Thr Val Leu Ser 
810 815 820 

CCA TAT TTC AAT ACT ACA CTC TTA CCC ACC TCC TCT TCA TCA ACA CCA 2550 
Pro Tyr Leu Asn Thr Thr Val Leu Pro Ser Ser Ser Ser Ser Arg Gly 
825 830 835 

AGC TTA GAT ACT TCT CCT TCT CAA AAA CAT ACA ACT TTC GAG ACA CAA 2598 
Ser Leu Asp Ser Ser Arg Ser Glu Lys Asp Arg Ser Leu Glu Arg Glu 
840 845 850 855 

CCC CCA ATT GGT CTA CCC AAC TAC CAT CCA CCA ACA GAA AAT CCA CGA 2646 
Arg Cly lie Cly Leu Cly Asn Tyr His Pro Ala Thr Clu Asn Pro Cly 
860 865 870 

ACT TCT TCA AAG CGA CCT TTC CAG ATC TCC ACC ACT CCA CCC CAC ATT 2694 
Thr Ser Ser Lys Arg Cly Leu Gin He Ser Thr Thr Ala Ala Gin He 
875 880 885 

GCC AAA CTC ATG GAA GAA CTC TCA CCC ATT CAT ACC TCT CAC CAA CAC 2742 
Ala Lys Val Met Glu Glu Val Ser Ala He His Thr Ser Gin Clu Asp 
890 895 900 

ACA ACT TCT CCC TCT ACC ACT GAA TTA CAT TGT CTC ACA CAT CAG ACA 2790 
Arg Ser Ser Cly Ser Thr Thr Clu Leu His Cys Val Thr Asp Clu Arg 
905 910 915 
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AAT CCA CTT ACA AGA ACC TCT CCT CCC CAT ACA CAT TCA AAC ACT TAC 
Asn Ala Leu Arg Arg Ser Ser Ala Ala His Thr Hia Ser Aan Thr Tyr 
920 925 930 935 



2838 



AAT TTC ACT AAG TCG CAA AAT TCA AAT ACC ACA TGT TCT ATC CCT TAT 
Aan Phe Thr Lye Ser Clu Aan Ser Aan Arg Thr Cya Ser Met Pro Tyr 
940 945 950 



2886 



GCC AAA TTA CAA TAC AAC AGA TCT TCA AAT CAT ACT TTA AAT ACT GTC 
Ala Lya Leu Glu Tyr Lya Arg Ser Ser Aan Aap Ser Leu Aan Ser Val 
955 960 965 



2934 



ACT ACT AAT CAT GGT TAT GGT AAA ACA CCT CAA ATC AAA CCC TCG ATT 
Ser Ser Aan Aap Cly Tyr Gly Lya Arg Gly Gin Met Lya Pro Ser He 
970 975 980 



2982 



CAA TCC TAT TCT GAA CAT CAT CAA ACT AAC TTT TGC ACT TAT GOT CAA 
Glu Ser Tyr Ser Glu Aap Aap Glu Ser Lya Phe Cya Ser Tyr Gly Gin 
985 990 995 



3030 



TAC CCA GCC CAC CTA GCC CAT AAA ATA CAT ACT CCA AAT CAT ATG CAT 
Tyr Pro Ala Aap Leu Ala Hie Lya lie Hia Ser Ala Aan Hia Met Aap 
1000 1005 1010 1015 



3078 



GAT AAT GAT GGA GAA CTA GAT ACA CCA ATA AAT TAT ACT CTT AAA TAT 
Aap Aan Aap Gly Glu Leu Aap Thr Pro Zle Aan Tyr Ser Leu Lya Tyr 
1020 1025 1030 



3126 



TCA GAT GAG CAG TTC AAC TCT GGA AGC CAA ACT CCT TCA CAG AAT GAA 
Ser Aap Glu Gin Leu Aan Ser Gly Arg Gin Ser Pro Ser Gin Aan Glu 
1035 1040 1045 



3174 



AGA TCG CCA AGA CCC AAA CAC ATA ATA GAA GAT CAA ATA AAA CAA ACT 
Arg Trp Ala Arg Pro Lya Hia He lie Glu Aap Clu He Lya Gin Ser 
1050 1055 1060 



3222 



GAG CAA AGA CAA TCA ACC AAT CAA ACT ACA ACT TAT CCT CTT TAT ACT 
Clu Gin Arg Gin Ser Arg Aan Gin Ser Thr Thr Tyr Pro Val Tyr Thr 
1065 1070 1075 



3270 



GAG AGC ACT CAT GAT AAA CAC CTC AAG TTC CAA CCA CAT TTT GGA CAG 
Glu Ser Thr Aap Aap Lya Hia Leu Lya Phe Gin Pro Hia Phe Gly Gin 
1080 1085 1090 1095 



3318 



CAC GAA TCT CTT TCT CCA TAC AGC TCA CCC CCA CCC AAT CCT TCA GAA 
Gin Glu Cya Val Ser Pro Tyr Arg Ser Arg Gly Ala Aan Gly Ser Glu 
1100 1105 1110 



3366 



ACA AAT CCA GTC GGT TCT AAT CAT GGA ATT AAT CAA AAT CTA AGC CAG 
Thr Aan Arg Val Gly Ser Aan Hia Gly tie Aan Gin Aan Val Ser Gin 
1115 1120 1125 



3414 



TCT TTC TGT CAA CAA GAT CAC TAT CAA CAT CAT AAC CCT ACC AAT TAT 
Ser Leu Cys Gin Clu Aap Aap Tyr Clu Aap Aap Lya Pro Thr Aan Tyr 
1130 1135 1140 



3462 



ACT GAA CCT TAC TCT CAA CAA CAA CAC CAT CAA CAA CAA GAG ACA CCA 
Ser Clu Arg Tyr Ser Clu Clu Glu Gin Hie Clu Clu Glu Clu Arg Pro 
1145 1150 1155 



3510 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/003'?6 



-52- 



ACA AAT TAT ACC ATA AAA TAT AAT CAA CAC AAA CCT CAT CTC CAT CAC 
Thr Aan Tyr Ser lie Ly« Tyr Aen Clu Clu Ly« Arg His V*l Amp Gin 
1160 1165 1170 1175 



3556 



CCT ATT GAT TAT ACT TTA AAA TAT GCC ACA CAT ATT CCT TCA TCA CAC 
Pro He Aap Tyr Ser Lou Lye Tyr Ala Thr A«p He Pro Ser Ser Cln 
1180 1185 1190 



3606 



AAA CAC TCA TTT TCA TTC TCA AAG ACT TCA TCT CCA CAA ACC ACT AAA 
Lys Cln Ser Phe Ser Phe Ser Lys Ser Ser Ser Cly Cln Ser Ser Lya 
1195 1200 1205 



3654 



ACC CAA CAT ATG TCT TCA ACC ACT GAG AAT ACC TCC ACA CCT TCA TCT 
Thr Clu His Met Ser Ser Ser Ser Clu Asn Thr Ser Thr Pro Ser Ser 
1210 1215 1220 



3702 



AAT GCC AAG ACC CAC AAT CAC CTC CAT CCA ACT TCT CCA CAC ACT ACA 
Asn Ala Lys Arg Cln Aan Cln Leu His Pro Ser Ser Ala cln Ser Arg 
1225 1230 1235 



3750 



ACT GGT CAC CCT CAA AAG CCT GCC ACT TCC AAA CTT TCT TCT ATT AAC 
Ser Cly Cln Pro Gin Lys Ala Ala Thr Cys Lys Val Ser Ser lie Asn 
1240 1245 1250 1255 



3798 



CAA CAA ACA ATA CAC ACT TAT TCT CTA CAA CAT ACT CCA ATA TCT TTT 
Cln Clu Thr He Cln Thr Tyr Cys Val Clu Asp Thr Pro He Cys Phe 
1260 1265 1270 

TCA ACA TCT ACT TCA TTA TCA TCT TTC TCA TCA CCT CAA CAT CAA ATA 
Ser Arg Cys Ser Ser Leu Ser Ser Leu Ser Ser Ala Clu Asp Clu He 
1275 1280 1285 



3846 



3894 



CCA TGT AAT CAC ACC ACA CAC CAA CCA CAT TCT CCT AAT ACC CTC CAA 
Gly Cys Asn Cln Thr Thr Gin Clu Ala Asp Ser Ala Asn Thr Leu Cln 
1290 1295 1300 



3942 



ATA CCA CAA ATA AAA CCA AAC ATT CCA ACT ACC TCA CCT CAA GAT CCT 
He Ala Clu He Lys Cly Lys He Cly Thr Arg Ser Ala Clu Asp Pro 
1305 1310 1315 



3990 



CTG AGC CAA CTT CCA GCA CTC TCA CAC CAC CCT AGA ACC AAA TCC ACC 
Val Ser Glu Val Pro Ala Val Ser Cln His Pro Arg Thr Lys Ser Ser 
1320 1325 1330 1335 



4038 



ACA CTC CAC GGT TCT ACT TTA TCT TCA CAA TCA GCC ACC CAC AAA CCT 4086 
Arg Leu Cln Gly Ser Ser Leu Ser Ser Clu Ser Ala Arg His Lys Ala 
1340 1345 1350 

CTT CAA TTT CCT TCA GCA GCG AAA TCT CCC TCC AAA ACT CCT CCT CAC 4134 
Val Glu Phe Pro Ser Gly Ala Lys Ser Pro Ser Lys Ser Cly Ala Gin 
1355 1360 1365 



ACA CCC AAA ACT CCA CCT CAA CAC TAT CTT CAC CAC ACC CCA CTC ATG 
Thr Pro Lya Ser Pro Pro Clu His Tyr Val Cln Clu Thr Pro Leu Met 
1370 1375 1380 



4182 



TTT ACC ACA TGT ACT TCT CTC ACT TCA CTT CAT ACT TTT CAC ACT CCT 
Phe Ser Arg Cys Thr Ser Val Ser Ser Leu Asp Ser Phe Clu Ser Arg 
1385 1390 1395 



4230 
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TCC ATT CCC ACC TCC CTT CAG ACT CAA CCA TCC ACT CCA ATC CTA ACT 42 78 

Ser He Ala Sar S«r Val Cln S«r Clu Pro Cy» Ser Cly Mat V*l Sar 
1400 1405 1410 1415 

CCC ATT ATA ACC CCC ACT CAT CTT CCA CAT ACC CCT CCA CAA ACC ATC 4326 
Civ He He Ser Pro Ser A«p Leu Pro Aep Ser Pro Cly Cln Thr Met 
^ 1420 1425 1430 

CCA CCA ACC ACA ACT AAA ACA CCT CCA CCA CCT CCT CAA ACA CCT CAA 4 374 

Pro Pro Ser Arg Ser Lye Thr Pro Pro Pro Pro Pro Cln Thr Ala Cln 
1435 1440 1445 

ACC AAC CCA CAA CTA CCT AAA AAT AAA CCA CCT ACT CCT CAA AAC ACA 4422 
Thr Lys Arg Clu Val Pro Lya Aen Lye Ala Pro Thr Ala Clu Ly« Arg 
1450 1455 1460 

GAC ACT CCA CCT AAC CAA CCT CCA CTA AAT CCT CCA CTT CAG ACC CTC 4470 
Clu Ser Cly Pro Lya Cln Ala Ala Val Aan Ala Ala Val Cln Ar^' Val 
1465 1470 1475 

CAG CTT CTT CCA CAT CCT CAT ACT TTA TTA CAT TTT CCC ACA CAA ACT 4518 
Gin Val Leu Pro Aap Ala Aap Thr Leu Leu Hia Phe Ala Thr Clu Ser 
1480 1485 1490 1495 

ACT CCA CAT CCA TTT TCT TCT TCA TCC ACC CTC ACT CCT CTC ACC CTC 4566 
Thr Pro Aap Gly Phe Ser Cya Ser Ser Ser Leu Ser Ala Leu Ser Leu 
1500 1505 1510 

CAT GAC CCA TTT ATA CAG AAA GAT CTC CAA TTA ACA ATA ATC CCT CCA 4614 
Aap Glu Pro Phe He Cln Lya Aap Val Clu Leu Arg He Met Pro Pro 
1515 1520 1525 

CTT CAG CAA AAT GAC AAT CCC AAT GAA ACA CAA TCA GAG CAG CCT AAA 4662 
Val Gin Glu Aan Aap Aan Gly Aan Glu Thr Glu Ser Glu Cln Pro Lya 
1530 1535 1540 

GAA TCA AAT GAA AAC CAA CAG AAA GAG CCA CAA AAA ACT ATT CAT TCT 4710 
Glu Ser Aan Glu Aan Cln Clu Lya Glu Ala Glu Lya Thr He Aap Ser 
1545 1550 1555 

CAA AAC GAC CTA TTA GAT GAT TCA CAT CAT CAT CAT ATT CAA ATA CTA 4758 
Glu Lya Aap Leu Leu Aap Aap Ser Aap Aap Aap Aap He Clu He Leu 

1560 1565 1570 1575 

GAA GAA TCT ATT ATT TCT CCC ATC CCA ACA AAC TCA TCA CGT AAA CCC 4806 
Clu Clu Cya He He Ser Ala Met Pro Thr Lya Ser Ser Arg Lya Cly 
1580 1585 1590 

AAA AAC CCA CCC CAG ACT CCT TCA AAA TTA CCT CCA CCT CTC CCA ACC 4854* 
Lya LVa Pro Ala Gin Thr Ala Ser Lya Leu Pro Pro Pro Val Ala Arg 
1595 1600 1605 

AAA CCA ACT CAG CTC CCT CTC TAC AAA CTT CTA CCA TCA CAA AAC ACC 4902 
Lya Pro Ser Cln Leu Pro yal Tyr Lya Leu Leu Pro Ser Cln Aan Arg 
1610 1615 1620 

TTC CAA CCC CAA AAC CAT CTT ACT TTT ACA CCG CCC CAT CAT ATC CCA 4950 
Leu Gin Pro Gin Lys Hia Val Ser Phe Thr Pro Gly Asp Aap Met Pro 
1625 1630 1635 
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CCC CTC TAT TGT CTT CAA GCG ACA CCT ATA AAC TTT TCC ACA CCT ACA 4998 
xra V*l Tvr Cv" Val Clu Cly Thr Pro lie A«n Phe S«r Thr Ala Thr 
1640 1645 1650 1655 

TCT CTA ACT CAT OTA ACA ATC CAA TCC CCT CCA AAT GAG TTA GCT GCT 5046 
ser Leu Ser Aap Lou Thr lie Clu Ser Pro Pro Aen Glu Leu Ala Ala 
1660 1665 1670 

GCA GAA CGA CTT ACA CCA CCA CCA CAG TCA GCT CAA TTT CAA AAA CCA 5094 
ClY Clu Cly Val Arg Cly Cly Ala Gin Ser Cly Clu Phe Clu Lye Arg 
1675 1680 1685 

GAT ACC ATT CCT ACA CAA CGC ACA ACT ACA GAT GAG GCT CAA CGA GCA 5142 
A0P Thr He Pro Thr Clu Gly Arg Ser Thr A«p Clu Ala Gin Gly Cly 
1690 1695 1700 

AAA ACC TCA TCT GTA ACC ATA CCT GAA TTG GAT GAC AAT AAA CCA GAG 5190 
Lye Thr Ser Ser Val Thr He Pro Glu Leu Asp Asp Asn Lys Ala Glu 
1705 1710 1715 

CAA CCT GAT ATT CTT GCA GAA TGC ATT AAT TCT CCT ATC CCC AAA CGC 5238 
Glu Gly Asp He Leu Ala Clu Cys He Asn Ser Ala Met Pro Lys Gly 
1720 1725 1730 1735 

AAA ACT CAC AAC CCT TTC CCT CTC AAA AAC ATA ATG GAC CAG CTC CAG 5286 
Lys Ser His Lys Pro Phe Arg Val Lys Lys He Met Asp Gin Val Gin 
1740 1745 1750 

CAA CCA TCT CCC TCG TCT TCT GCA CCC AAC AAA AAT CAG TTA GAT GGT 5334 
Cln Ala Ser Ala Ser Ser Ser Ala Pro Asn Lys Asn Gin Leu Asp Cly 
1755 1760 1765 

AAC AAA AAC AAA CCA ACT TCA CCA CTA AAA CCT ATA CCA CAA AAT ACT 5382 
Lys Lys Lys Lys Pro Thr Ser Pro Val Lys Pro He Pro Gin Asn Thr 
1770 1775 1780 

CAA TAT AGG ACA CCT GTA AGA AAA AAT GCA GAC TCA AAA AAT AAT TTA 5430 
Clu Tyr Arg Thr Arg Val Arg Lys Asn Ala Asp Ser Lys Asn Asn Leu 
1785 1790 1795 

AAT GCT CAG AGA CTT TTC TCA CAC AAC AAA CAT TCA AAC AAA CAG AAT 5478 
Asn Ala Glu Arg Val Phe Ser Asp Asn Lys Asp Ser Lys Lys Cln Asn 
1800 1805 1810 1815 

TTC AAA AAT AAT TCC AAC GAC TTC AAT GAT AAG CTC CCA AAT AAT GAA 5526 
Leu Lys Asn Asn Ser Lys Asp Phe Asn Asp Lys Leu Pro Asn Asn Glu 

1820 1825 1830 

CAT ACA CTC AGA CGA ACT TTT GCT TTT CAT TCA CCT CAT CAT TAG ACC 5574 
Asp Arg Val Arg Gly Ser Phe Ala Phe Asp Ser Pro His His Tyr Thr 
1835 1840 1845 

CCT ATT GAA CGA ACT CCT TAC TCT TTT TCA CGA AAT GAT TCT TTG ACT 5622 
Pro He Glu Gly Thr Pro Tyr Cys Phe Ser Arg Asn Asp Ser Leu Ser 
1850 1855 1860 

TCT CTA CAT TTT CAT CAT CAT CAT CTT GAC CTT TCC ACC GAA AAG GCT 5670 
Ser Leu Asp Phe Asp Asp Asp Asp Val Asp Leu Ser Arg Clu Lys Ala 
1865 1870 1875 
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CAA TTA AGA AAC GCA AAA CAA AAT AAG CAA TCA GAG GCT AAA CTT ACC 5718 
Glu Leu Arg Lya Ala Lys Glu Asn Vym Glu Ser Glu AIa Lya V«l Thr 
1880 1885 1890 1895 

ACC CAC ACA GAA CTA ACC TCC AAC CAA CAA TCA OCT AAT AAG ACA CAA 5766 
Ser His Thr Glu Leu Thr Ser Asn Gin Gin Ser Ala Asn Lye Thr Gin 
1900 1905 1910 

GCT ATT CCA AAG CAC CCA ATA AAT CGA CGT CAC CCT AAA CCC ATA CTT 5814 
Ala lie Ala Lys Gin Pro lie Asn Arg Gly Gin Pro Lya Pro lie Leu 
1915 1920 1925 

CAC AAA CAA TCC ACT TTT CCC CAC TCA TCC AAA CAC ATA CCA CAC ACA 5862 
Gin Lya Gin Ser Thr Phe Pro Gin Ser Ser Lye Asp He Pro Aap Arg 
1930 1935 1940 

CCC GCA GCA ACT CAT CAA AAC TTA CAC AAT TTT CCT ATT CAA AAT ACT 5910 
Gly Ala Ala Thr Asp Glu Lys Leu Gin Asn Phe Ala lie Glu Asn Thr 
1945 1950 1955 

CCA GTT TCC TTT TCT CAT AAT TCC TCT CTC ACT TCT CTC ACT CAC ATT 5958 
Pro Val Cys Phe Ser His Asn Ser Ser Leu Ser Ser Leu Ser Asp He 
1960 1965 1970 1975 

CAC CAA GAA AAC AAC AAT AAA CAA AAT CAA CCT ATC AAA CAC ACT CAC 6006 
Aap Gin Glu Asn Asn Asn Lys Glu Asn Glu Pro lie Lys Glu Thr Glu 
1980 1985 1990 

CCC CCT CAC TCA CAC GCA CAA CCA ACT AAA CCT CAA CCA TCA CCC TAT 6054 
Pro Pro Asp Ser Gin Gly Glu Pro Ser Lys Pro Gin Ala Ser Gly Tyr 
1995 2000 2005 

CCT CCT AAA TCA TTT CAT CTT CAA CAT ACC CCA CTT TCT TTC TCA ACA 6102 
Ala Pro Lys Ser Phe His Val Glu Asp Thr Pro Val Cys Phe Ser Arg 
2010 2015 2020 

AAC ACT TCT CTC ACT TCT CTT ACT ATT CAC TCT CAA CAT CAC CTC TTC 6150 
Asn Ser Ser Leu Ser Ser Leu Ser He Asp Ser Glu Asp Asp Leu Leu 
2025 2030 2035 

CAC CAA TCT ATA ACC TCC GCA ATC CCA AAA AAG AAA AAG CCT TCA AGA 6198 
Cln Glu Cys He Ser Ser Ala Met Pro Lys Lys Lys Lys Pro Ser Arg 
2040 2045 2050 2055 

CTC AAG GCT CAT AAT CAA AAA CAT ACT CCC ACA AAT ATC CCT CCC ATA 6246 
Leu Lys Gly Asp Asn Glu Lys His Ser Pro Arg Asn Met Gly Gly He 
2060 2065 2070 

TTA GCT CAA CAT CTC ACA CTT GAT TTC AAA CAT ATA CAC AGA CCA CAT 6294 
Leu Gly Glu Asp Leu Thr Leu Asp Leu Lys Asp He Cln Arg Pro Asp 
2075 2080 2085 

TCA CAA CAT CCT CTA TCC CCT CAT TCA CAA AAT TTT CAT TCC AAA CCT 6342 
Ser Glu His Gly Leu Ser Pro Asp Ser Glu Asn Phe Asp Trp Lys Ala 
2090 2095 2100 

ATT CAG GAA CGT CCA AAT TCC ATA CTA ACT ACT TTA CAT CAA GCT GCT 6390 
He Cln Glu Gly Ala Asn Ser He Val Ser Ser Leu His Cln Ala Ala 
2105 2110 2115 
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CCT OCT CCA TCT TTA TCT AGA CAA OCT TCC TCT CAT TCA CAT TOO ATC 
Ala Ala Ala Cye Leu ser Arg Cln Ala Sar Ser Aap Ser hmp Sor Ilo 
2120 2125 2130 2135 

CTT TCC CTG AAA TCA CCA ATC TCT CTC CCA TCA CCA TTT CAT CTT ACA 
Leu Ser Leu Lys Ser Gly lie Ser Leu Cly Ser Pro Phe Hi« Leu Thr 
2140 2145 2150 



6438 



6486 



CCT GAT CAA CAA CAA AAA CCC TTT ACA ACT AAT AAA GCC CCA CCA ATT 
Pro Asp Gin Clu Glu Lya Pro Phe Thr Ser Aan Lye Cly Pro Arg lie 
2155 2160 2165 



6534 



CTA AAA CCA CCC GAG AAA ACT ACA TTG CAA ACT AAA AAG ATA CAA TCT 6582 
Leu Lye Pro Cly Glu Lys Ser Thr Leu Clu Thr Lys Lys Zle Glu Ser 
2170 2175 2180 

GAA ACT AAA CCA ATC AAA CCA CCA AAA AAA CTT TAT AAA ACT TTG ATT 6630 
Glu Ser Lys Cly lie Lys Cly Cly Lys Lys Val Tyr Lys Ser Leu lie 
2185 2190 2195 

ACT CCA AAA CTT CGA TCT AAT TCA CAA ATT TCA CCC CAA ATC AAA CAG 6678 
Thr Gly Lys Val Arg Ser Asn Ser Glu Zle Ser Gly Gin Met Lys Gin 
2200 2205 2210 2215 

CCC CTT CAA CCA AAC ATC CCT TCA ATC TCT CCA CCC ACC ACA ATC ATT 6726 
Pro Leu Cln Ala Asn Met Pro Ser He Ser Arg Gly Arg Thr Met He 
2220 2225 2230 



CAT ATT CCA CCA CTT CGA AAT ACC TCC TCA ACT ACA ACT CCT CTT TCT 
His He Pro Cly Val Arg Asn Ser Ser Ser Ser Thr Ser Pro Val Ser 
2235 2240 2245 



6774 



AAA AAA CCC CCA CCC CTT AAC ACT CCA GCC TCC AAA ACC CCT ACT GAA 
Lys Lys Gly Pro Pro Leu Lys Thr Pro Ala Ser Lys Ser Pro Ser Clu 
2250 2255 2260 



6822 



CCT CAA ACA GCC ACC ACT TCT CCT ACA CCA CCC AAC CCA TCT GTG AAA 
Gly Cln Thr Ala Thr Thr Ser Pro Arg Cly Ala Lys Pro Ser Val Lys 
2265 2270 2275 



6870 



TCA GAA TTA ACC CCT CTT GCC AGG CAG ACA TCC CAA ATA CCT GGG TCA 
Ser Glu Leu Ser Pro Val Ala Arg Cln Thr Ser Cln He Gly Cly Ser 
2280 2285 2290 2295 



6918 



ACT AAA CCA CCT TCT ACA TCA CGA TCT AGA CAT TCC ACC CCT TCA ACA 6966 

Ser Lys Ala Pro Ser Arg Ser Gly Ser Arg Asp Ser Thr Pro Ser Arg 
2300 2305 2310 

CCT CCC CAG CAA CCA TTA ACT AGA CCT ATA CAG TCT CCT CCC CCA AAC 7014 

Pro Ala Cln Cln Pro Leu Ser Arg Pro He Cln Ser Pro Cly Arg Asn 
2315 2320 2325 

TCA ATT TCC CCT CGT AGA AAT CCA ATA ACT CCT CCT AAC AAA TTA TCT 7062 

Ser He Ser Pro Gly Arg Asn Gly He Ser Pro Pro Asn Lys Leu Ser 
2330 2335 2340 

CAA CTT CCA AGG ACA TCA TCC CCT ACT ACT CCT TCA ACT AAC TCC TCA 7110 

Gin Leu Pro Arg Thr Ser Ser Pro Ser Thr Ala Ser Thr Lys Ser Ser 

2345 2350 2355 
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GGT TCT CCA AAA ATC TCA TAT ACA TCT CCA CCT AGA CAC ATC AGC CAA 7158 
Cly Ser Cly Ly« Mot Ser Tyr Thr Smr Pro Cly Arg Gin M«t S«r Gin 
2360 2365 2370 2375 

CAG AAC CTT ACC AAA CAA ACA GGT TTA TCC AAG AAT GCC ACT ACT ATT 7206 
Gin Asn Leu Thr Lya Gin Thr Gly Leu Smr Lyu Aan Ala Ser Ser lie 
2380 2385 2390 

CCA AGA ACT CAG TCT GCC TCC AAA GCA CTA AAT CAG ATG AAT AAT GGT 72 54 

Pro Arg Ser Glu Ser Ala Ser Lya Gly Leu Aen Gin Met Asn Asn Gly 
2395 2400 2405 

AAT GGA GCC AAT AAA AAG GTA GAA CTT TCT AGA ATG TCT TCA ACT AAA 7302 
Aen Gly Ala Aen Lye Lye Val Glu Leu Ser Arg Met Ser Ser Thr Lya 
2410 2415 2420 

TCA ACT GGA ACT GAA TCT CAT AGA TCA GAA AGA CCT GTA TTA GTA CGC 7350 
Ser Ser Gly Ser Glu Ser Aap Arg Ser Glu Arg Pro Val Leu Val Arg 
2425 2430 2435 

CAG TCA ACT TTC ATC AAA CAA GCT CCA AGC CCA ACC TTA AGA AGA AAA 7398 
Gin Ser Thr Phe lie Lya Glu Ala Pro Ser Pro Thr Leu Arg Arg Lya 
2440 2445 2450 2455 

TTC GAG CAA TCT GCT TCA TTT GAA TCT CTT TCT CCA TCA TCT AGA CCA 7446 
Leu Glu Glu Ser Ala Ser Phe Glu Ser Leu Ser Pro Ser Ser Arg Pro 
2460 2465 2470 

GCT TCT CCC ACT AGG TCC CAG GCA CAA ACT CCA GTT TTA ACT CCT TCC 7494 
Ala Ser Pro Thr Arg Ser Gin Ala Gin Thr Pro Val Leu Ser Pro Ser 
2475 2480 2485 

CTT CCT CAT ATG TCT CTA TCC ACA CAT TCC TCT CTT CAG CCT GGT GGA 7542 
Leu Pro Aap Met Ser Leu Ser Thr Hia Ser Ser Val Gin Ala Gly Gly 
2490 2495 2500 

TCC CCA AAA CTC CCA CCT AAT CTC ACT CCC ACT ATA GAG TAT AAT GAT 7590 
Trp Arg Lya Leu Pro Pro Aan Leu Ser Pro Thr lie Glu Tyr Aan Aap 
2505 2510 2515 

CCA ACA CCA GCA AAG CGC CAT GAT ATT GCA CCC TCT CAT TCT GAA ACT 7638 
Gly Arg Pro Ala Lya Arg Hia Aap lie Ala Arg Ser His Ser Glu Ser 
2520 2525 2530 2535 

CCT TCT AGA CTT CCA ATC AAT ACC TCA GCA ACC TGG AAA CGT GAG CAC 7686 
Pro Ser Arg Leu Pro lie Aan Arg Ser Gly Thr Trp Lya Arg Glu Hia 
2540 2545 2550 

ACC AAA CAT TCA TCA TCC CTT CCT CCA GTA ACC ACT TGG AGA ACA ACT 7734 
Ser Lya Hia Ser Ser Ser Leu Pro Arg Val Ser Thr Trp Arg Arg Thr 
2555 2560 256S 

CCA ACT TCA TCT TCA ATT CTT TCT CCT TCA TCA GAA TCC ACT GAA AAA 7782 
Gly Ser Ser Ser Ser lie Leu Ser Ala Ser Ser Glu Ser Ser Glu Lya 
2570 2575 2580 

GCA AAA ACT GAG GAT GAA AAA CAT CTC AAC TCT ATT TCA GCA ACC AAA 7830 
Ala Lys Ser Glu Asp Glu Lya His Val Aan Ser lie Ser Gly Thr Lys 
2585 2590 2595 
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CAA ACT AAA GAA AAC CAA CTA TCC CCA AAA CCA ACA TCC ACA AAA ATA 7878 

Cln Sar Lya CIu Aan Cln Val Sar Ala Lya Cly Thr Trp Arg Lya I la 

2600 2605 2610 2615 

AAA CAA AAT GAA TTT TCT CCC ACA AAT ACT ACT TCT CAC ACC CTT TCC 7926 
Lya Glu Abo Clu Phe Sar Pro Thr Aan Sar Thr Ser Cln Thr Val Sar 
2620 2625 2M0 

TCA GGT OCT ACA AAT OCT CCT CAA TCA AAC ACT CTA ATT TAT CAA ATC 79 74 

Ser Cly Ala Thr Aan Cly Ala Clu Sar Lya Thr Leu lie Tyr Cln Met 
2635 2640 2645 

CCA CCT CCT CTT TCT AAA ACA CAC CAT CTT TCC CTC ACA ATT CAC CAC 8022 
Ala Pro Ala Val Ser Lya Thr Clu Aap Val Trp Val Arg lie Clu Aap 
2650 2655 2660 

TCT CCC ATT AAC AAT CCT ACA TCT CCA ACA TCT CCC ACA CCT AAT ACT 8070 
Cya Pro lie Aan Aan Pro Arg Ser Cly Arg Ser Pro Thr Cly Aan Thr 
2665 2670 2675 

CCC CCC CTC ATT CAC ACT CTT TCA CAA AAC CCA AAT CCA AAC ATT AAA 8118 
Pro Pro Val lie Aap Ser Val Ser Glu Lya Ala Aan Pro Asn He Lya 
2680 2685 2690 2695 

CAT TCA AAA CAT AAT CAC CCA AAA CAA AAT CTC CCT AAT CCC ACT CTT 8166 
Aap Ser Lya Aap Aen Cln Ala Lya Cln Aan Val Cly Aen Cly Ser Val 

2700 2705 2710 

CCC ATC CCT ACC CTC CGT TTC CAA AAT CCC CTC ACC TCC TTT ATT CAC 8214 
Pro Met Arg Thr Val Cly Leu Clu Aan Arg Leu Thr Sar Phe lie Cln 
2715 2720 2725 

GTC GAT CCC CCT CAC CAA AAA CCA ACT CAC ATA AAA CCA CCA CAA AAT 8262 
Val Aap Ala Pro Aap Gin Lya Cly Thr Glu Ha Lya Pro Cly Cln Aan 
2730 2735 2740 

AAT CCT CTC CCT CTA TCA GAG ACT AAT CAA ACT CCT ATA CTC GAA CCT 8310 
Aan Pro Val Pro Val Ser Glu Thr Aan Glu Sar Pro lie Val Glu Arg 
2745 2750 275S 

ACC CCA TTC ACT TCT ACC ACC TCA ACC AAA CAC ACT TCA CCT ACT CCC 8358 
Thr Pro Phe Ser Sar Ser Ser Ser Ser Lye Hia Ser Ser Pro Ser Cly 
2760 2765 2770 2775 

ACT CTT CCT GCC ACA CTC ACT CCT TTT AAT TAC AAC CCA ACC CCT ACC 8406 
Thr Val Ala Ala Arg Val Thr Pro Phe Aan Tyr Aan Pro Sar Pro Arg 
2780 2785 2790 

AAA ACC ACC CCA GAT ACC ACT TCA CCT CCC CCA TCT CAG ATC CCA ACT 8454 
Lya ser Ser Ala Aap Sar Thr Ser Ala Arg Pro Ser Cln He Pro Thr 
2795 2800 2805 

CCA GTC AAT AAC AAC ACA AAC AAC CCA GAT TCC AAA ACT CAC ACC ACA 8502 
Pro Val Aan Aan Aan Thr Lya Lye Arg Aap Ser Lya Thr Aap Sar Thr 
2810 2815 2820 

CAA TCC ACT CCA ACC CAA ACT CCT AAC CCC CAT TCT GGC TCT TAC CTT 8550 
Clu Ser Ser Cly Thr Gin Ser Pro Lya Arg His Ser Cly Ser Tyr Leu 
2825 2830 2835 
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CTC ACA TCT CTT TAAAAGACAG GAAGAATGAA ACTAAGAAAA TTCTATCTTA 8602 

Val Thr Scr Val 

2840 

ATTACAACTG CTATATAGAC ATTTTGTTTC AAATGAAACT TTAAAAGACT CAAAAATTTT 8662 

GTAAATAGGT TTGATTCTTG TTAGAGGCTT TTTGTTCTCG AAGCCATATT TGATAGTATA 8722 

CTTTGTCTTC ACTCGTCTTA TTTTGGGAGG CACTCTTGAT CGTTACGAAA AAATAGAAAG 8782 

CCAAGTATGT TTGTACAGTA TGTTTTACAT GTATTTAAAC TAGCATCCCA TCCCAACTTC 8842 

CTTAATTATT CCTTGTCTAA AATAATCAAC ACTACAGATA GGAAATATGA TATATTCCTG 8902 

TTATCAATCA TTTCTAGATT ATAAACTGAC TAAACTTACA TCAGGGGAAA ATTCGTATTT 8962 

ATGCAAAAAA AAAATGTTTT TGTCCTTGTG AGTCCATCTA ACATCATAAT TAATCATGTG 9022 

GCTCTCAAAT TCACAGTAAT ATGGTTCCCG ATCAACAAGT TTACCCACCC TCCTTTGCTT 9082 

ACTCCATGAA TCAAACTGAT GGTTCAATTT CAGAAGTAAT GATTAACACT TATGTCGTCA 9142 

CATGATGTGC ATAGAGATAG CTACACTGTA ATAATTTACA CTATTTTGTG CTCCAAACAA 9202 

AACAAAAATC TGTGTAACTG TAAAACATTG AATCAAACTA TTTTACCTGA ACTACATTTT 9262 

ATCTGAAAGT AGGTAGAATT TTTGCTATGC TGTAATTTGT TGTATATTCT GGTATTTGAG 9322 

GTGACATGGC TGCTCTTTAT TAATGAGACA TGAATTGTGT CTCAACACAA ACTAAATCAA 9382 

CATTTCAGAA TAAATTATTG CTGTATGTAA ACTGTTACTG AAATTCGTAT TTGTTTGAAG 9442 

GGTTTGTTTC ACATTTGTAT TAATTAATTG TTTAAAATGC CTCTTTTAAA AGCTTATATA 9502 

AATTTTTTCT TCAGCTTCTA TGCATTAAGA GTAAAATTCC TCTTACTGTA ATAAAAACAT 9562 

TGAAGAAGAC TGTTGCCACT TAACCATTCC ATGCGTTGGC ACTT 9606 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2843 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Ala Ala Ser Tyr Aap Gin Leu Leu Lye Gin Val Glu Ala Leu 
IS 10 IS 

Lye Met Glu Asn Ser Asn Leu Arg Gin Glu Leu Glu Asp Asn Ser Asn 

20 25 30 

His Leu Thr Lye Leu Glu Thr Glu Ala Ser Asn Met Lys Glu Val Leu 
35 40 45 

Lys Gin Leu Gin Gly Ser He Glu Asp Glu Ala Met Ala Scr Ser Gly 
50 55 60 
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Gln lie Aap Lau L«u Clu Xrg L«u Lys GIu Lmu Asn Lmu Amp smr Smr 

65 70 75 80 

Asn Phe Pro Gly Val Lys Leu Krg Ser Lym Met Ser Leu Arg Ser Tyr 

as 90 95 

Gly Ser Arg Glu Gly Ser Val Ser Ser Arg Ser Gly Glu Cys Ser Pro 
100 105 110 

Val Pro Met Gly Ser Phe Pro Arg Arg Gly Phe Val A«n Gly Ser Arg 
115 120 125 

Glu Ser Thr Gly Tyr Leu Glu Glu Leu Glu Lys Glu Arg Ser Leu Leu 
130 135 140 

Leu Ala Asp Leu Asp Lys Glu Glu Lys Glu Lys Asp Trp Tyr Tyr Ala 
145 ISO 155 160 

Gin Leu Gin Asn Leu Thr Lys Arg lie Asp Ser Leu Pro Leu Thr Glu 
165 170 175 

Asn Phe Ser Leu Gin Thr Asp Leu Thr Arg Arg Gin Leu Glu Tyr Glu 
180 185 190 

Ala Arg Gin lie Arg Val Ala Met Glu Glu Gin Leu Gly Thr Cys Gin 

195 200 205 

Asp Met Glu Lys Arg Ala Gin Arg Arg lie Ala Arg lie Gin Gin tie 
210 215 220 

Glu Lys Asp lie Leu Arg lie Arg Gin Leu Leu Gin Ser Gin Ala Thr 
225 230 235 240 

Glu Ala Glu Arg Ser Ser Gin Asn Lys His Glu Thr Gly Ser His Asp 
245 250 255 

Ala Glu Arg Gin Asn Glu Gly Gin Gly Val Gly Glu He Asn Met Ala 
260 265 270 

Thr Ser Gly Asn Gly Gin Gly Ser Thr Thr Arg Met Asp His Glu Thr 
275 260 265 

Ala Ser Val Leu Ser Ser Ser Ser Thr His Ser Ala Pro Arg Arg Leu 

290 295 300 

Thr Ser Hie Leu Gly Thr Lys Val Glu Met Val Tyr Ser Leu Leu Ser 

305 310 315 320 

Met Leu Gly Thr His Asp Lys Asp Asp Met Ser Arg Thr Leu Leu Ala 

325 330 335 

Met Ser Ser Ser Gin Asp Ser Cys He Ser Met Arg Gin Ser Gly Cys 
340 345 350 

Leu Pro Leu Leu He Gin Leu Leu His Gly Asn Asp Lys Asp Ser Val 
355 360 365 

Leu Leu Gly Asn Ser Arg Gly Ser Lys Glu Ala Arg Ala Arg Ala Ser 
370 375 380 

Ala Ala Leu His Asn He He His Ser Gin Pro Asp Asp Lys Arg Gly 
385 390 395 400 
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Arg Ara Glu Zl« Arg Val Lmxx His hmu Lmu Glu Gin He Arg Ala Tyr 
405 410 415 

Cyo Glu Thr Cym Trp Glu Trp Gin Glu Ala Hia Glu Pro Gly Met Aap 
420 425 430 

Gin Aap Lya Aan Pro Met Pro Ala Pro Val Glu His Gin lie Cys Pro 
435 440 445 

Ala Val Cya Val Leu Met Lya Leu Ser Phe Aap Glu Glu His Arg His 
450 455 460 

Ala Met Aan Glu Leu Gly Gly Leu Gin Ala lie Ala Glu X^u Leu Gin 
465 470 475 480 

Val Aap Cys Glu Met Tyr Gly Leu Thr Asn Asp Hia Tyr Ser lie Thr 
465 490 495 

Leu Arg Arg Tyr Ala Gly Met Ala Leu Thr Asn Leu Thr Phe Gly Asp 
500 505 510 

Val Ala Aan Lys Ala Thr Leu Cys Ser Met Lys Gly Cys Met Arg Ala 

515 520 525 

Leu Val Ala Gin Leu Lys Ser Glu Ser Glu Asp Leu Gin Gin Val He 

530 535 540 

Ala Ser Val Leu Arg Asn Leu Ser Trp Arg Ala Aap Val Aan Ser Lys 
545 550 555 560 

Lys Thr Leu Arg Glu Val Gly Ser Val Lya Ala Leu Met Glu Cys Ala 

565 570 575 

Leu Glu Val Lya Lya Glu Ser Thr Leu Lya Ser Val Leu Ser Ala Leu 

580 585 590 

Trp Asn Leu Ser Ala Hia Cys Thr Glu Asn Lys Ala Aap He Cys Ala 

595 600 605 

Val Asp Gly Ala Leu Ala Phe Leu Val Gly Thr Leu Thr Tyr Arg Ser 
610 615 620 

Gin Thr Aan Thr Leu Ala He He Glu Ser Gly Gly Gly He Leu Arg 
625 630 635 640 

Aan Val Ser Ser Leu He Ala Thr Aan Glu Aap His Arg Gin He Leu 

645 650 655 

Arg Glu Asn Asn Cys Lau Gin Thr Leu Leu Gin His Leu Lys Ser His 

660 665 670 

Ser Leu Thr He Val Ser Aan Ala Cya Gly Thr Leu Trp Aan Leu Ser 

675 680 685 

Ala Arg Asn Pro Lya Aap Gin Glu Ala Leu Trp Aap Met Gly Ala Val 
690 695 700 

Ser Met Leu Lys Asn Leu He His Ser Lys His Lys Met He Ala Met 
705 710 715 720 

Gly Ser Ala Ala Ala Leu Arg Aan Leu Met Ala Asn Arg Pro Ala Lys 
725 730 735 
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Tyr Ly« Amp Ala Ann lie Met Ser Pro Cly Ser Ser Leu Pro Ser Leu 
740 745 750 

Hi8 Val Arg Lye Gin Lys Ala Leu Glu Ala Glu Leu Aap Ala Gin His 

755 760 765 

Leu Ser Glu Thr Phe Aap Asn lie Asp Asn Leu Ser Pro Lys Ala Ser 
770 775 780 

His Arg Ser Lys Gin Arg His Lys Gin Ser Leu Tyr Gly Asp Tyr Val 
785 790 795 ^ i 

Phe Asp Thr Asn Arg His Asp Asp Asn Arg Ser Asp Asn Phe Asn Thr 

305 810 815 

Gly Asn Met Thr Val Leu Ser Pro Tyr Leu Asn Thr Thr Val Leu Pro 
820 825 830 

ser Ser Ser Ser Ser Arg Gly Ser Leu Asp Ser Ser Arg Ser Glu Lvs 
835 840 845 

Asp Arg Ser Leu Glu Arg Glu Arg Gly He Gly Leu Gly Asn Tyr His 
850 855 860 

Pro Ala Thr Glu Asn Pro Gly Thr Ser Ser Lys Arg Gly Leu Gin He 
865 870 875 880 

Ser Thr Thr Ala Ala Gin He Ala Lys Val Met Glu Glu Val Ser Ala 
885 890 895 

He His Thr Ser Gin Glu Asp Arg Ser Ser Gly Ser Thr Thr Glu Leu 
900 905 910 

His Cys Val Thr Asp Glu Arg Asn Ala Leu Arg Arg Ser Ser Ala Ala 

915 920 925 

His Thr His Ser Asn Thr Tyr Asn Phe Thr Lys Ser Glu Asn Ser Asn 
930 935 940 

Arg Thr Cys Ser Met Pro Tyr Ala Lys Leu Glu Tyr Lys Arg Ser Ser 
945 950 955 960 

Asn Asp Ser Leu Asn Ser Val Ser Ser Asn Asp Gly Tyr Gly Lys Arg 

965 970 975 

Cly Gin Met Lys Pro Ser He Glu Ser Tyr Ser Glu Asp Asp Glu Ser 
980 985 990 

Lys Phe Cys Ser Tyr Gly Gin Tyr Pro Ala Asp Leu Ala His Lys He 
995 1000 1005 

His Ser Ala Asn His Met Asp Asp Asn Asp Cly Glu Leu Asp Thr Pro 
1010 1015 1020 

He Asn Tyr Ser Leu Lys Tyr Ser Asp Glu Gin Leu Asn Ser Gly Ara 
1025 1030 1035 1040 

Gin Ser Pro Ser Gin Asn Glu Arg Trp Ala Arg Pro Lya His He He 
1045 1050 1055 - 

Glu Asp Glu He Lys Gin Ser Glu Gin Arg Gin Ser Arg Asn Gin Ser 
1060 1065 1070 
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Thr Thr Tyr Pro Val Tyr Thr Glu Ser Thr A«p hmp Ly» Hi« Leu Ly« 
1075 1080 1085 

Phc Gin Pro Hie Phe Gly Gin Gin Glu Cy« Val Ser Pro Tyr Arg Ser 
1090 1095 1100 

Arq Gly Ala Aan Gly Ser Glu Thr A«n Arg Val Gly Ser Asn Hia Gly 
1105 1110 1115 1120 

lie Aan Gin Aan Val Ser Gin Ser Leu Cya Gin Glu Aap Aap Tyr Glu 
1125 1130 1135 

Aap Aap Lya Pro Thr Aan Tyr Ser Glu Arg Tyr Ser Glu Glu Glu Gin 
1140 1145 1150 

Hia Glu Glu Glu Glu Arg Pro Thr Aan Tyr Ser lie Lya Tyr Aan Glu 
1155 1160 1165 

Glu Lya Arg Hia Val Aap Gin Pro lie Aap Tyr Ser Leu Lya Tyr Ala 
1170 1175 1180 

Thr Aap lie Pro Ser Ser Gin Lya Gin Ser Phe Ser Phe Ser Lya Ser 
1185 1190 1195 1200 

Ser Ser Gly Gin Ser Ser Lya Thr Glu Hia Met Ser Ser Ser Ser Glu 
1205 1210 1215 

Aan Thr Ser Thr Pro Ser Ser Aan Ala Lya Arg Gin Aan Gin Leu Hia 
1220 1225 1230 

Pro Ser Ser Ala Gin Ser Arg Ser Gly Gin Pro Gin Lya Ala Ala Thr 
1235 1240 1245 

Cya Lya Val Ser Ser lie Aan Gin Glu Thr He Gin Thr Tyr Cya Val 
1250 1255 1260 

Glu Aap Thr Pro He Cya Phe Ser Arg Cya Ser Ser Leu Ser Ser Leu 
1265 1270 1275 1280 

Ser Ser Ala Glu Aap Glu He Gly Cya Aan Gin Thr Thr Gin Glu Ala 
1285 1290 1295 

Aap Ser Ala Aan Thr Leu Gin He Ala Glu He Lya Gly Lya He Gly 
1300 1305 1310 

Thr Arg Ser Ala Glu Aap Pro Val Ser Glu Val Pro Ala Val Ser Gin 

1315 1320 1325 

Hia Pro Arg Thr Lya Ser Ser Arg Leu Gin Gly Ser Ser Leu Ser Ser 
1330 1335 1340 

Glu Ser Ala Arg Hia Lya Ala Val Glu Phe Pro Ser Gly Ala Lya Ser 
1345 1350 1355 1360 

Pro Ser Lya Ser Gly Ala Gin Thr Pro Lya Ser Pro Pro Glu Hia Tyr 
1365 1370 1375 

Val Gin Glu Thr Pro Leu Met Phe Ser Arg Cya Thr Ser Val Ser Ser 
1380 1385 1390 

Leu Asp Ser Phe Glu Ser Arg Ser He Ala Ser Sar Val Gin Ser Glu 
1395 1400 1405 
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Pro Cys Ser Gly Het Val S«r Gly He Il« Smr Pro Smr Amp L«u Pro 
1410 1415 1420 

Asp Ser Pro Gly Gin Thr Het Pro Pro Ser Arg Ser Lye Thr Pro Pro 
1425 1430 1435 1440 

Pro Pro Pro Gin Thr Ala Gin Thr Lye Arg Clu Val Pro Lye Aen Lye 
1445 1450 1455 

Ala Pro Thr Ala Clu Lye Arg Glu Ser Gly Pro Ly« Gin Ala Ala Val 
1460 1465 1470 

Aen Ala Ala Val Gin Arg Val Gin Val Leu Pro Asp Ala Aap Thr Leu 
1475 1480 1485 

Leu Hia Phe Ala Thr Glu Ser Thr Pro Asp Gly Phe Ser Cys Ser Ser 
1490 1495 1500 

Ser Leu Ser Ala Leu Ser Leu Asp Glu Pro Phe He Gin Lys Aep Val 
1505 1510 1515 1520 

Glu Leu Arg He Met Pro Pro Val Gin Glu Asn Aap Asn Gly Asn Glu 
1525 1530 1535 

Thr Glu Ser Glu Gin Pro Lys Glu Ser Asn Glu Asn Gin Glu Lys Glu 
1540 1545 1550 

Ala Glu Lys Thr He Asp Ser Glu Lys Asp Leu Leu Asp Asp Ser Asp 

1555 1560 1565 

Aep Asp Asp He Glu He Leu Glu Glu Cys He He Ser Ala Met Pro 
1570 1575 1580 

Thr Lys Ser Ser Arg Lye Gly Lys Lys Pro Ala Cln Thr Ala Ser Lys 
1585 1590 1595 1600 

Leu Pro Pro Pro Val Ala Arg Lys Pro Ser Gin Leu Pro Val Tyr Lys 
1605 1610 1615 

Leu Leu Pro Ser Gin Asn Arg Leu Gin Pro Gin Lya His Val Ser Phe 
1620 1625 1630 

Thr Pro Gly Aap Aep Met Pro Arg Val Tyr Cys Val Glu Gly Thr Pro 
1635 1640 1645 

He Asn Phe Ser Thr Ala Thr Ser Leu Ser Asp Leu Thr He Glu Ser 
1650 1655 1660 

Pro Pro Asn Glu Leu Ala Ala Gly Glu Gly Val Arg Gly Gly Ala Gin 
1665 1670 1675 1680 

Ser Gly Glu Phe Glu Lys Arg Asp Thr He Pro Thr Glu Gly Arg Ser 
1685 1690 1695 

Thr Asp Glu Ala Gin Gly Gly Lys Thr Ser Ser Val Thr He Pro Glu 
1700 1705 1710 

Leu Aap Asp Asn Lys Ala Glu Glu Gly Asp He Leu Ala Glu Cys He 
1715 1720 1725 

Asn Ser Ala Met Pro Lys Gly Lys Ser His Lys Pro Phe Arg Val Lys 
1730 1735 1740 
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Lys lie Met Aep Gin Val Gin Gin AlA Ser Ala Ser Ser S«r Ala Pro 
1745 1750 1755 1760 

Aon Lys Asn Gin Leu Asp Gly Lys Lys Lya Ly« Pro Thr Ser Pro Val 
1765 1770 1775 

Lya Pro lie Pro Gin Aan Thr Glu Tyr Arg Thr Arg Val Arg Lys Aan 
1780 1785 1790 

Ala Asp Ser Lys Asn Asn Leu Asn Ala Glu Arg Val Phe Ser Asp Asn 
1795 1800 1805 

Lys Asp Ser Lys Lys Gin Asn Leu Lys Asn Asn Ser Lys Asp Phe Asn 
1810 1815 1820 

Asp Lys Leu Pro Asn Asn Glu Asp Arg Val Arg Gly Ser Phe Ala Phe 
1825 1830 1835 1840 

Asp Ser Pro His His Tyr Thr Pro lie Glu Gly Thr Pro Tyr Cys Phe 
1845 1850 1855 

Ser Arg Asn Asp Ser Leu Ser Ser Leu Asp Phe Asp Asp Asp Asp Val 
1860 1865 1870 

Asp Leu Ser Arg Glu Lys Ala Glu Leu Arg Lys Ala Lys Glu Asn Lys 
1875 1880 1885 

Glu Ser Glu Ala Lys Val Thr Ser His Thr Glu Leu Thr Ser Asn Gin 
1890 1895 1900 

Gin Ser Ala Asn Lys Thr Gin Ala lie Ala Lys Gin Pro lie Asn Arg 
1905 1910 1915 1920 

Gly Gin Pro Lys Pro lie Leu Gin Lys Gin Ser Thr Phe Pro Gin Ser 
1925 1930 1935 

Ser Lys Asp lie Pro Asp Arg Gly Ala Ala Thr Asp Glu Lys Leu Gin 
1940 1945 1950 

Asn Phe Ala lie Glu Asn Thr Pro Val Cys Phe Ser His Asn Ser Ser 
1955 1960 1965 

Leu Ser Ser Leu Ser Asp lie Asp Gin Glu Asn Asn Asn Lys Glu Asn 
1970 1975 1980 

Glu Pro He Lys Glu Thr Glu Pro Pro Asp Ser Gin Gly Glu Pro Ser 
1985 1990 1995 2000 

Lys Pro Gin Ala Ser Gly Tyr Ala Pro Lys Ser Phe His Val Glu Asp 
2005 2010 2015 

Thr Pro Val Cys Phe Ser Arg Asn Ser Ser Leu Ser Ser Leu Ser He 
2020 2025 2030 

Asp Ser Glu Asp Asp Leu Leu Gin Glu Cys He Ser Ser Ala Met Pro 
2035 2040 2045 

Lye Lys Lys Lys Pro Ser Arg Leu Lys Gly Asp Asn Glu Lys His Ser 
2050 2055 2060 

Pro Arg Asn Met Gly Gly He Leu Gly Glu Asp Leu Thr Leu Asp Leu 
2065 2070 2075 2080 
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Lya Asp lie Gin Xrg Pro Asp Ser Glu Mis Gly Leu Ssr Pro Asp Ssr 
208S 2090 2095 

Glu Asn Phe Asp Trp Lys Ala lie Gin Glu Gly Ala Asn Ser lie Val 
2100 2105 2110 

Ser Ser Leu His Cln Ala Ala Ala Ala Ala Cys Leu Ser Arg Gin Ala 
2115 2120 2125 

Ser Ser Asp Ser Asp Ser lie Leu Ser Leu Lys Ser Gly lie Ser Leu 
2130 2135 2140 

Gly Ser Pro Phe His Leu Thr Pro Asp Gin Glu Glu Lys Pro Phe Thr 
2145 2150 2155 2160 

Ser Asn Lys Gly Pro Arg lie Leu Lys Pro Gly Glu Lys Ser Thr Leu 
2165 2170 2175 

Glu Thr Lys Lys lie Glu Ser Glu Ser Lys Gly lie Lys Gly Gly Lys 
2180 2185 2190 

Lys Val Tyr Lys Ser Leu lie Thr Gly Lys Val Arg Ser Asn Ser Glu 
2195 2200 2205 

lie Ser Gly Gin Met Lys Gin Pro Leu Gin Ala Asn Met Pro Ser lie 
2210 2215 2220 

Ser Arg Gly Arg Thr Met lie His lie Pro Gly Val Arg Asn Ser Ser 
2225 2230 2235 2240 

Ser Ser Thr Ser Pro Val Ser Lys Lys Gly Pro Pro Leu Lys Thr Pro 
2245 2250 2255 

Ala Ser Lys Ser Pro Ser Glu Gly Gin Thr Ala Thr Thr Ser Pro Arg 
2260 2265 2270 

Gly Ala Lys Pro Ser Val Lys Ser Glu Leu Ser Pro Val Ala Arg Gin 
2275 2280 2285 

Thr Ser Gin lie Gly Gly Ser Ser Lys Ala Pro Ser Arg Ser Gly Ser 
2290 2295 2300 

Arg Asp Ser Thr Pro Ser Arg Pro Ala Gin Gin Pro Leu Ser Arg Pro 
2305 2310 2315 2320 

He Gin Ser Pro Gly Arg Asn Ser tie Ser Pro Gly Arg Asn Gly He 

2325 2330 2335 

Ser Pro Pro Asn Lys Leu Ser Gin Leu Pro Arg Thr Ser Ser Pro Ser 
2340 2345 2350 

Thr Ala Ser Thr Lys Ser Ser Gly Ser Gly Lys Met Ser Tyr Thr Ser 
2355 2360 2365 

Pro Gly Arg Gin Met Ser Gin Gin Asn Leu Thr Lys Gin Thr Gly Leu 
2370 2375 2380 

Ser Lys Asn Ala Ser Ser He Pro Arg Ser Glu Ser Ala Ser Lys Gly 
2385 2390 2395 2400 

Leu Asn Gin Met Asn Asn Gly Asn Gly Ala Asn Lys Lys Val Glu Leu 
2405 2410 2415 
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Ser Arg Met Ser Ser Thr Lys Ser Ser Gly S«r CIu Smr A«p Arq Ser 
2420 2425 2430 

Glu Arg Pro Val Leu Val Arg Gin Ser Thr Phe lie Ly« Glu Ala Pro 
2435 2440 2445 

Ser Pro Thr Leu Arg Arg Lys Leu Glu Glu Ser Ala Ser Phe Glu Ser 
2450 2455 2460 

Leu Ser Pro Ser Ser Arg Pro Ala Ser Pro Thr Arg Ser Gin Ala Gin 
2465 2470 2475 2480 

Thr Pro Val Leu Ser Pro Ser Leu Pro Asp Met Ser Leu Ser Thr Hie 
2485 2490 2495 

Ser Ser Val Gin Ala Gly Gly Trp Arg Lye Leu Pro Pro Aen Leu Ser 
2500 2505 2510 

Pro Thr lie Glu Tyr Aan Asp Gly Arg Pro Ala Lys Arg His Asp lie 
2515 2520 2525 

Ala Arg Ser His Ser Glu Ser Pro Ser Arg Leu Pro lie Asn Arg Ser 
2530 2535 2540 

Gly Thr Trp Lys Arg Glu His Sar Lys His Ser Ser Ser Leu Pro Aro 
2545 2550 2555 2560 

Val Ser Thr Trp Arg Arg Thr Gly Ser Ser Ser Ser He Leu Ser Ala 
2565 2570 2575 

Ser Ser Glu Ser Ser Glu Lys Ala Lys Ser Glu Asp Glu Lys His Val 
2580 2585 2590 

Asn Ser Xle Ser Gly Thr Lys Gin Ser Lys Glu Asn Gin Val Ser Ala 
2595 2600 2605 

Lys Gly Thr Trp Arg Lys lie Lys Glu Asn Glu Phe Ser Pro Thr Asn 
2610 2615 2620 

Ser Thr Ser Gin Thr Val Ser Ser Gly Ala Thr Asn Gly Ala Glu Ser 
2625 2630 2635 2640 

Lys Thr Leu He Tyr Gin Met Ala Pro Ala Val Ser Lys Thr Glu Asp 
2645 2650 2655 

Val Trp Val Arg He Glu Asp Cys Pro He Asn Asn Pro Arg Ser Gly 

2660 2665 2670 

Arg Ser Pro Thr Gly Aan Thr Pro Pro Val He Asp Ser Val Ser Glu 
2675 2680 2685 

Lys Ala Asn Pro Asn He Lys Asp Ser Lys Asp Asn Gin Ala Lys Gin 
2690 2695 2700 

Asn Val Gly Asn Gly Ser Val Pro Met Arg Thr Val Gly Leu Glu Asn 
2705 2710 2715 2720 

Arg Leu Thr Ser Phe He Gin Val Asp Ala Pro Asp Gin Lys Gly Thr 
2725 2730 2735 

Glu He Lys Pro Gly Gin Aan Asn Pro Val Pro Val Ser Glu Thr Asn 
2740 2745 2750 
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Clu Ser Pro Il« Val Clu hrg Thr Pro Ph« Ser S«r Smr Smr Ser ser 
2755 2760 2765 

Lve HLm ser Ser Pro Ser Cly Thr V«l Ala Ala Arg Val Thr Pro Phe 
^ 2770 2775 2780 

Aen Tyr Aen Pro Ser Pro Arg Lye Ser Ser Ala Aep Ser Thr Ser Ala 
2785 2790 2795 2800 

Ara Pro ser Cln lie Pro Thr Pro Val Aan Aen Aen Thr Lye Lye Arg 
2805 2810 2815 

A0P Ser Lye Thr Aep Ser Thr Clu Ser Ser Cly Thr Cln Ser Pro Lye 
^ ^ 2820 2825 2830 

Arg His Ser Cly Ser Tyr Leu Val Thr Ser Val 
2835 2840 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3172 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CONA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE? 

(B) CLONE: DP1(TB2) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..630 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CCA GTC CCC OCT CCA CTC TAT CCG CCA CTA CCA ACA CCC CCC CGN GGC 48 

Ala Val Ala Ala Pro VaI Tyr Pro Ala Leu Cly Thr Ala Pro Cly Cly 

1 5 10 IS 

GAG ACG GTC CCC CCC ATG TCT CCG CCC ATC AGG GAG ACC TTC CAC CCG 96 

Glu Thr Val Pro Ala Met Ser Ala Ala Met Arg Glu Arg Phe Asp Arg 

20 25 30 

TTC CTC CAC GAG AAG AAC TGC ATG ACT CAC CTT CTC CCC AAG CTC CAC 144 

Phtf Leu His Glu Lye Asn Cys Met Thr Asp Leu Leu Ala Lys Leu Glu 

3S 40 45 

CCC AAA ACC CCC CTC AAC AGG ACC TTC ATC GCT CTT GOT GTC ATC CCA 192 

Ala Lya Thr Cly Val Asn Arg Ser Phe He Ala Leu Cly Val He Cly 

50 55 60 , 

CTC CTC CCC TTG TAC CTG GTC TTC GCT TAT CCA CCC TCT CTC CTC TGC 240 

Leu val Ala Leu Tyr Leu Val Phe Cly Tyr Cly Ala Ser Leu Leu Cys 

65 70 75 80 
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AAC CTC ATA CCA TTT CCC TAC CCA CCC TAC ATC TCA ATT AAA CCT ATA 
Asn Lou Il« Cly Ph« Cly Tyr Pro Ala Tyr lie Ser lie Lye Ala He 

8S 90 95 



288 



CAC ACT CCC AAC AAA GAA CAT CAT ACC CAG TCC CTC ACC TAC TGC CTA 
Clu Ser Pro Aen Lye Clu Aep Asp Thr Cln Trp Leu Thr Tyr Trp Val 
100 105 110 



336 



CTC TAT CCT CTC TTC ACC ATT CCT GAA TTC TTC TCT CAT ATC TTC CTC 
Val Tyr Gly Val Phe Ser He Ala Glu Phe Phe Ser Aep He Phe Leu 
115 120 125 



384 



TCA TCG TTC CCC TTC TAC TAC ATG CTG AAC TCT CCC TTC CTC TTC TGC 
Ser Trp Phe Pro Phe Tyr Tyr Met Leu Lya Cya Cly Phe Leu Leu Trp 
130 135 140 



432 



TCC ATG CCC CCC ACC CCT TCT AAT CGG CCT GAA CTG CTC TAC AAC CCC 
Cya Met Ala Pro Ser Pro Ser Asn Gly Ala Glu Leu Leu Tyr Lye Arg 
145 150 155 160 



480 



ATC ATC CCT CCT TTC TTC CTC AAG CAC GAG TCC CAG ATC GAC ACT CTC 
He He Arg Pro Phe Phe Leu Lya Hia Glu Ser Gin Met Aap Ser Val 
165 170 175 



528 



CTC AAG GAC CTT AAA CAC AAG TCC AAA GAG ACT CCA GAT CCC ATC ACT 
Val Lys Aap Leu Lya Aap Lys Ser Lya Glu Thr Ala Aap Ala lie Thr 
180 185 190 



576 



AAA GAA GCG AAG AAA CCT ACC CTG AAT TTA CTC CCT GAA GAA AAG AAG 
Lys Glu Ala Lys Lys Ala Thr Val Asn Leu Leu Gly Glu Glu Lys Lys 
195 200 205 



624 



ACC ACC TAAACCAGAC TAAACCACAC TCGATGGAAA CTTCCTGCCC TCTCTGTACC 
Ser Thr 
210 



680 



TTCCTACTGG 


ACCTTCATCT 


TATATTACGG 


ACTCTCGTAT 


AATTATTTTA 


ATAATCTTCC 


740 


CTTGGAAACA 


TTTTTGAGAT 


ATTAAACATT 


GGAATGTGTT 


GTAAGTTTCT 


TTGCTTACTT 


800 


TTACTGTCTA 


TATATATAGG 


GAGCACTTTA 


AACTTAATGC 


AGTGGGCAGT 


GTCCACGTTT 


860 


TTCGAAAATC 


TATTTTGCCT 


CTGGGTAGGA 


AAAGATGTAT 


GTTGCTATCC 


TGCACGAAAT 


920 


ATAAACTTAA 


AATAAAATTA 


TATACCCCAC 


ACCCTGTCTA 


CTTTACTGGG 


CTCTCCCTGC 


980 


ACCSATTTTC 


TCTGTAGTTA 


CATTTAGGRT 


AATCTTTATG 


GTTCTACTTC 


CTRTAATCTA 


1040 


CAATTTTATA 


TAATTCNCRA 


ATCTTTTTAA 


TCTATTTGTG 


CACATGTACA 


TATGGAAATG 


1100 


TTACTGTCTG 


ACTACANCAT 


GCATCATGCT 


CATGGGGAGG 


GAGCAGGGGA 


AGCTTCTATG 


1160 


TGTCATTTAT 


AACTTCTCTA 


CAGTAAGACC 


ACCTGCCAAA 


ACCTCGAGGA 


ACCATTGTGC 


1220 


TCGTCTGCTC 


TACTAAATAA 


TACTTTACCA 


AATACCTGAT 


TAATATGCAA 


GTGAACAAAG 


1280 


TGAGAAATCA 


AATCGAATGG 


AGATTGGCCT 


GGTTCTTTCC 


CTAGTATATG 


GCATATGAAT 


1340 


ACCAGCATAC 


CTTTATAAAG 


CACTTACTTA 


CTTAGTTACT 


CACTCTAGTC 


ATAAATCCGC 


1400 


AAATTTACAC 


ACACACACAC 


ACACACACAC 


ACACACACAC 


ACACACACAC 


ACACACACAC 


1460 
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ACTACCCTCT 


AACTCTCAAT 


TCCCTGAAAA 


ACTAGTAATA 


CTGTCTTATC 


TGCTATAAAC 


1S20 


TTTACATATT 


TGTCTATTGT 


CAAGATCCTA 


CANTGGAMNC 


CATTTCTGGT 


TTTATCTTCA 


1580 


NACSGGACAN 


ACATGTTGAT 


TTAGTCTTCT 


TTCCCAATCT 


TCTTTTTTAA 


MCCAGTTTNA 


1640 


CGMNCTTCTC 


RAGATTTGYC 


CACCTCTGAT 


TACATGTATG 


TTCTYCTTTG 


TATCATKACC 


1700 


AACAACATGC 


TAATGRCGAC 


ACCTAGCTCT 


RAGMGCAATT 


CTCGGAGANT 


GARACGNWCT 


1760 


ATARAGTMNC 


CCATAATCTG 


CTTGGCAATA 


GTTAAGTCAA 


TCTATCTTCA 


GTTTTTCTCT 


1820 


CGCCTTTAAG 


GTCAAACACA 


AGAGGCTTCC 


CTAGTTTACA 


AGTCAGAGTC 


ACTTCTAGTC 


1880 


CATTTAAATG 


CCCTCATCCC 


TATTCTTTGT 


GTTGATAAGC 


TGCACAKGAC 


TACATAGTAA 


1940 


GTACAGANCA 


GTAAAGTTAA 


MNCCGATGTC 


TCCATTCATC 


TGCCAANTCG 


NTATAGAGAG 


2000 


CAATTTCTCT 


GGACTAGAAA 


ATCTGAGTTT 


TACACCATAC 


TGTTAAGAGT 


CCTTTTGAAT 


2060 


TAAACTAGAC 


TAAAACAAGT 


CTATAACTAA 


ACTAACAAGA 


TTAAATATCC 


AGCCAGTACA 


2120 


CTATTTTTTA 


AGGCAAATAA 


AGATGATTAG 


CTCACCTTGA 


GNTAACAATC 


AGGTAAGATC 


2180 


ATNACAAXGT 


CTCATGATCT 


KAANAATATT 


AAAGATATCA 


ATACTAAGTG 


ACAGTATCAC 


2240 


NNCTAATATA 


ATATGGATCA 


CAGCATTTAT 


TTTGGGGAGG 


AAAACAGTGG 


TGATTACCGG 


2300 


CATTTTATTA 


AACTTAAAAC 


TTTGTAGAAA 


GCAAACAAAA 


TTGTTCTTGG 


GAGAAAATCA 


2360 


ACTTTTACAT 


TAAAAAAATT 


TTAAGTAWCT 


AGGAGTATTT 


AAATCCTTTT 


CCCATAAATA 


2420 


AAAGTACAGT 


TTTCTTGGTG 


GCAGAATGAA 


AATCAGCAAC 


NTCTAGCATA 


TAGACTATAT 


2480 


AATCACATTG 


ACAGCATATA 


GAATATATTA 


TCAGACAAGA 


TGAGGAGGTA 


CAAAAGTTAC 


2540 


TATTCCTCAT 


AATGACTTAC 


AGGCTAAAAN 


TAGNTNTAAA 


ATACTATATT 


AAATTCTGAA 


2600 


TCCAATTTTT 


TTTTGTTCCC 


TXGAGACCAA 


AATTTAAGTT 


AACTGTTGCT 


GGCAGTCTAA 


2660 


GTGTAAATGT 


TAACAGCAGG 


AGAAGTTAAG 


AATTGAGCAG 


TTCTGTTGCA 


TGATTTCCCA 


2720 


AATGAAATAC 


TGCCTTCCCT 


AGAGTTTGAA 


AAACTAATTG 


AGCCTGTGCC 


TGGCTAGAAA 


2780 


ACAACCCTTT 


ATTTGAATGT 


GAATAGTGTT 


TCAAAGGTAT 


GTAGTTACAG 


AATTCCTACC 


2840 


AAACAGCTTA 


AATTCTTCAA 


GAAAGAATTC 


CTCCAGCAGT 


TATTCCCTTA 


CCTGAAGGCT 


2900 


TCAATCATTT 


GGATCAACAA 


CTGCTACTCT 


CGGGAAGACT 


CCTCTACTCA 


CAGCTGAAGA 


2960 


AAATGAGCAC 


ACCCTTCACA 


CTGTTATCAC 


CTATCCTGAA 


GATGTGATAC 


ACTGAATGGA 


3020 


AATAAATAGA 


TGTAAATAAA 


ATTCAGWTCT 


CATTTAAAAA 


AAACCATGTG 


CCCAATGGGA 


3080 


AAATGACCTC 


ATGTTGTGGT 


TTAAACAGCA 


ACTGCACCCA 


CTAGCACAGC 


CCATTCAGCT 


3140 


ANCCTATATA 


TACATCTCTG 


TCAGTGCCCC 


TC 






3172 



(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQaENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Val Ala Ala Pro Val Tyr Pro Ala Leu Cly Thr Ala Pro Cly Gly 
15 10 15 

Glu Thr Val Pro Ala Met Ser Ala Ala Met Arg Glu Arg Phe Aap Ara 
20 25 30 

Phe Leu His Glu Lys Asn Cys Met Thr Asp Leu Leu Ala Lys Leu Glu 
35 40 45 

Ala Lys Thr Gly Val Aan Arg Ser Phe lie Ala Leu Gly Val lie Cly 
50 55 60 

Leu Val Ala Leu Tyr Leu Val Phe Cly Tyr Gly Ala Ser Leu Leu Cys 
65 70 75 80 

Asn Leu lie Cly Phe Gly Tyr Pro Ala Tyr lie Ser lie Lys Ala lie 
85 90 95 

Glu Ser Pro Asn Lys Glu Asp Asp Thr Gin Trp Leu Thr Tyr Trp Val 
100 105 110 

Val Tyr Cly Val Phe Ser lie Ala Glu Phe Phe Ser Asp lie Phe Leu 
115 120 125 

Ser Trp Phe Pro Phe Tyr Tyr Met Leu Lys Cys Gly Phe Leu Leu Trp 
130 135 140 

Cys Met Ala Pro Ser Pro Ser Asn Gly Ala Glu Leu Leu Tyr Lys Arg 
145 150 155 160 

lie lie Arg Pro Phe Phe Leu Lys His Glu Ser Gin Met Asp Ser Val 
165 170 175 

Val Lys Asp Leu Lys Asp Lys Ser Lys Glu Thr Ala Asp Ala lie Thr 
180 185 190 

Lys Glu Ala Lys Lys Ala Thr Val Asn Leu Leu Gly Glu Glu Lys Lys 
195 200 205 

Ser Thr- 
210 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(vi) ORIGINAL SOURCE: 

(A) okCANISH: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLOWE: TBI 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Ala Pro Val Val Val Cly Ser Cly Arg Ala Pro Arg His Pro Ala 
15 10 15 

Pro Ala Ala Mot Hia Pro Arg Arg Pro Asp Cly Pho Asp Gly Leu Gly 
20 25 30 

Tyr Arg Gly Gly Ala Arg Asp Glu Gin Gly Phe Gly Gly Ala Phe Pro 
35 40 45 

Ala Arg Ser Phe Ser Thr Gly Ser Asp Leu Gly His Trp Val Thr Thr 
SO 55 60 

Pro Pro Asp lie Pro Gly Ser Arg Asn Leu His Trp Gly Glu Lys Ser 
65 70 75 60 

Pro Pro Tyr Gly Val Pro Thr Thr Ser Thr Pro Tyr Glu Gly Pro Thr 
85 90 95 

Glu Glu Pro Phe Ser Ser Gly Gly Gly Gly Ser Val Gin Gly Gin Ser 
100 105 110 

Ser Glu Gin Leu Asn Arg Phe Ala Gly Phe Gly lie Gly Leu Ala Ser 
115 120 125 

Leu Phe Thr Glu Asn Val Leu Ala His Pro Cys lie Val Leu Arg Arg 
130 135 140 

Gin Cys Gin Val Asn Tyr His Ala Gin His Tyr His Leu Thr Pro Phe 
145 150 155 160 

Thr Val lie Asn lie Met Tyr Ser Phe Asn Lys Thr Gin Gly Pro Arg 
165 170 175 

Ala Leu Trp Lys Gly Met Gly Ser Thr Phe lie Val Gin Gly Val Thr 
180 185 190 

Leu Gly Ala Glu Gly lie lie Ser Glu Phe Thr Pro Leu Pro Arg Glu 
195 200 205 

Val Leu His Lys Trp Ser Pro Lys Gin lie Gly Glu His Leu Leu Leu 
210 215 220 

Lys Ser Leu Thr Tyr Val Val Ala Met Pro Phe Tyr Ser Ala Ser Leu 
225 230 235 240 

lie Glu Thr Val Gin Ser Glu lie lie Arg Asp Asn Thr Gly lie Leu 
245 250 255 

Glu Cys Val Lys Glu Gly lie Cly Arg Val lie Gly Met Cly Val Pro 
260 265 270 
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Hls Sec Lye Krg Lau L«u Pro L«u L«u Smr Leu He Pha Pro Thr Val 
275 280 285 

Leu His Cly Val Leu His Tyr He lie Ser Ser Val He Gin Lys Phe 
290 295 300 

Val Leu Leu lie Leu Lys Krg Lys Thr Tyr Asn Ser His Leu Ala Glu 

305 310 315 320 

Ser Thr Ser Pro Val Gin Ser Met Leu Asp Ala Tyr Phe Pro Glu Leu 
325 330 335 

He Ala Asn Phe Ala Ala Ser Leu Cys Ser Asp Val He Leu Tyr Pro 
340 345 350 

Leu Glu Thr Val Leu His Arg Leu His He Gin Gly Thr Arg Thr He 
355 360 365 

He Asp Asn Thr Asp Leu Gly Tyr Glu Val Leu Pro He Asn Thr Gin 

370 375 380 

Tyr Glu Gly Met Arg Asp Cys He Asn Thr He Arg Gin Glu Glu Gly 
385 390 395 400 

Val Phe Gly Phe Tyr Lys Gly Phe Gly Ala Val He He Gin Tyr Thr 
405 410 415 

Leu His Ala Ala Val Leu Gin He Thr Lys He He Tyr Ser Thr Leu 
420 425 430 

Leu Gin 



(2) INFORMATION FOR SEQ ID NO:6i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(8) CLONE: YS-39<TB2) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Glu Leu Arg Arg Phe Asp Arg Phe Leu His Glu Lys Asn Cys Met Thr 
15 10 15 

Asp Leu Leu Ala Lys Leu Glu Ala Lys Thr Gly Val Asn Arg Ser Phe 
20 25 30 

He Ala Leu Gly Val He Gly Leu Val Ala Leu Tyr Leu Val Phe Cly 
35 40 45 
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Tvr Cly Ala Ser Leu Leu Cy» Kmn Leu lie Cly Phe Cly Tyr Pro Ala 

^ 50 55 60 

Tvr lie Ser lie Lys Ala He Clu Ser Pro Aan Lya Clu Aap Aap Thr 

65 70 75 80 

Gin Trp Leu Thr Tyr Trp Val Val Tyr Cly Val Phe Ser He Ala Clu 
85 90 95 

Phe Phe Ser Aap He Phe Leu Ser Trp Phe Pro Phe Tyr Tyr He Leu 
100 105 110 

LVB CvB Cly Phe Leu Leu Trp Cys Met Ala Pro Ser Pro Ser Asn Cly 
^ ^ 115 120 125 

Ala Clu Leu Leu Tyr Lya Arg He He Arg Pro Phe Phe Leu Lya Hia 
130 135 140 

Clu Ser Cln Met Aap Ser Val Val Lys Aap Leu Lys Aap Lys Ala Lys 
145 150 155 160 

Clu Thr Ala Aap Ala He Thr Lya Clu Ala Lya Lya Ala Thr Val Aan 
165 170 175 

Leu Leu Cly Clu Clu Lya Lya Ser Thr 
180 185 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2842 amino acida 

(B) TYPE: amino acid 

(C) STRANDEONESS : a ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo aapiena 

<vii) IMMEDIATE SOURCE: 
(B) CLONE: APC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Ala Ala Ala Ser Tyr Asp Cln Leu Leu Lya Cln Val Clu Ala Leu 
15 10 15 

Lya Met Clu Aan Ser Aan Leu Arg Cln Clu Leu Clu Aap Aan Ser Aan 
20 25 30 

Hia Leu Thr Lya Leu Clu Thr Clu Ala Ser Aan Met Lya Clu Val Leu 
35 40 45 

Lys Gin Leu Cln Cly Ser He Clu Aap Clu Ala Met Ala Ser Ser Cly 
50 55 60 

Gin He Asp Leu Leu Clu Arg Leu Lya Clu Leu Asn Leu Asp Ser Ser 
65 70 75 80 
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Aan Phe Pro Gly Val Lye Lau Arg Smr Lys Met S«r L«u Arg Ser Tyr 

85 90 95 

Gly Ser Arg Glu Gly Smr Val Ser Ser Arg Ser Gly Glu Cys Ser Pro 
100 105 110 

Val Pro Met Gly Ser Phe Pro Arg Arg Gly Phe Val Aen Gly Ser Arg 
lis 120 125 

Glu Ser Thr Gly Tyr Leu Glu Glu Leu Glu Lys Glu Arg Ser Leu Leu 
130 135 140 

Leu Ala Aap Leu Asp Lye Glu Glu Lya Glu Lys Asp Trp Tyr Tyr Ala 
145 150 155 160 

Gin Leu Gin Aan Leu Thr Lya Arg lie Aap Ser Leu Leu Thr Glu Aan 

165 170 175 

Phe Ser Leu Gin Thr Aap Met Thr Arg Arg Gin Leu Glu Tyr Glu Ala 
180 Ids 190 

Arg Gin lie Arg Val Ala Met Glu Glu Gin Leu Gly Thr Cya Gin Aap 
195 200 205 

Met Glu Lys Arg Ala Gin Arg Arg lie Ala Arg He Gin Gin He Glu 
210 215 220 

Lys Asp He Leu Arg He Arg Gin Leu Leu Gin Ser Gin Ala Thr Glu 
225 230 235 240 

Ala Glu Arg Ser Ser Gin Asn Lys His Glu Thr Gly Ser His Asp Ala 

245 250 255 

Glu Arg Gin Asn Glu Gly Gin Gly Val Gly Glu He Asn Met Ala Thr 
260 265 270 

Ser Gly Aan Gly Gin Gly Ser Thr Thr Arg Met Asp His Glu Thr Ala 
275 280 285 

Ser Val Leu Ser Ser Ser Ser Thr His Ser Ala Pro Arg Arg Leu Thr 
290 295 300 

Ser Kia Leu Gly Thr Lya Val Glu Met Val Tyr Ser Leu Leu Ser Met 

305 310 315 320 

Leu Gly Thr Hie Aap Lys Asp Asp Met Ser Arg Thr Leu Leu Ala Met 

325 330 335 

Ser Ser Ser Gin Aap Ser Cys He Ser Met Arg Gin Ser Gly Cys Leu 
340 345 350 

Pro Leu Leu He Gin Leu Leu His Gly Asn Asp Lys Asp Ser Val Leu 

355 360 365 

Leu Gly Asn Ser Arg Gly Ser Lys Glu Ala Arg Ala Arg Ala Ser Ala 
370 375 380 

Ala Leu His Asn He He His Ser Gin Pro Asp Asp Lys Arg Gly Arg 
385 390 395 400 

Arg Glu He Arg Val Leu His Leu Leu Glu Gin He Arg Ala Tyr Cye 
405 410 415 
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Glu Thr Cy« Trp Glu Trp Gin Glu Al* HLm Glu Pro Gly Met Aap Cln 
^ 420 425 430 

Aap Lye Aen Pro Met Pro Ala Pro Val Glu Hia Gin He Cya Pro Ala 
435 440 445 

val CV8 Val Leu Met Ly« Leu Ser Phe Aap Glu Glu His Arg His Ala 
450 455 460 

Met Asn Glu Leu Gly Gly Leu Gin Ala He Ala Glu Leu Leu Gin Val 
465 470 475 480 

ABp Cya Glu Met Tyr Gly Leu Thr Aan Aap Hia Tyr Ser He Thr Leu 
^ ^ 485 490 495 

Arc Ara Tvr Ala Gly Met Ala Leu Thr Aan Leu Thr Phe Gly Aap Val 
500 505 510 

Ala Aan Lya Ala Thr Leu Cya Ser Met Lya Gly Cys Met Arg Ala Leu 
515 520 525 

Val Ala Gin Leu Lya Ser Glu Ser Glu Aap Leu Gin Gin Val He Ala 
530 535 540 

ser Val Leu Arg Aan Leu Ser Trp Arg Ala Aap Val Aan Ser Lya Lya 



545 



550 555 560 

Thr Leu Arg Glu Val Gly Ser Val Lya Ala Leu Met Glu Cya Ala Lau 

565 570 575 

Glu Val Lys Lya Glu Ser Thr Leu Lya Ser Val Leu Ser Ala Leu Trp 
580 585 590 

Aan Leu Ser Ala Hia Cya Thr Glu Aan Lya Ala Aap He Cya Ala Val 

595 600 605 

Aao Gly Ala Leu Ala Phe Leu Val Gly Thr Leu Thr Tyr Arg Ser Gin 
^ 610 615 520 

Thr Aan Thr Leu Ala He He Glu Ser Gly Gly Gly He Leu Arg Aan 
625 630 635 640 

Val Ser Ser Leu He Ala Thr Aan Glu Aap Hia Arg Cln He Leu Arg 
645 650 655 

Glu Asn Asn Cya Leu Gin Thr Leu Leu Cln Hia Leu Lya Ser Hia Ser 

660 665 670 

Leu Thr He Val Ser Aan Ala Cya Gly Thr Leu Trp Aan Leu Ser Ala 

675 680 685 

Ara Aen Pro Lya Aap Gin Glu Ala Leu Trp Aap Met Gly Ala Val Ser 

690 695 700 

Met Leu Lya Aan Leu He Hia Ser Lya Hia Lya Met He Ala Met Gly 
705 710 715 720 

Ser Ala Ala Ala Leu Arg Aan Leu Met Ala Aan Arg Pro Ala Lya Tyr 
725 730 735 

Lya Aap Ala Aan He Met Ser Pro Gly Ser Ser Leu Pro Ser Leu Hia 
^ ^ 740 745 750 
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Val Ara Lys Gin Lys Ala L«u Glu Ala Glu Lau Asp Ala Gin His Lau 

755 760 765 

Ser Glu Thr Phe Asp Aan lie Aap Aan Lau Sor Pro Lya Ala Sar Hia 

770 775 780 

Arg Ser Lys Cln Arg Hia Lya Gin Sar I*au Tyr Cly Aap Tyr Val Phe 
785 790 795 800 

Aep Thr Aan Arg Hia Aap Aap Aan Arg Sar Aap Aan Phe Aan Thr Cly 
805 810 815 

Aan Met Thr Val Lau Sar Pro Tyr Lau Aan Thr Thr Val Lau Pro Sar 
820 825 830 

Ser Sar Ser Ser Arg Gly Ser Leu Aap Sar Ser Arg Ser Glu Lya Aap 
835 840 845 

Arg Ser Leu Glu Arg Glu Arg Gly lie Gly Leu Gly Aan Tyr His Pro 
850 855 860 

Ala Thr Glu Aan Pro Gly Thr Ser Ser Lya Arg Gly Leu Gin lie Ser 
865 870 875 880 

Thr Thr Ala Ala Gin lie Ala Lya Val Met Glu Glu Val Sar Ala He 
885 890 895 

His Thr Ser Gin Glu Aap Arg Ser Ser Gly Ser Thr Thr Glu Leu His 
900 905 910 

Cya Val Thr Asp Glu Arg Asn Ala Lau Arg Arg Ser Ser Ala Ala Hia 
915 920 925 

Thr Hia Ser Aan Thr Tyr Aan Phe Thr Lya Ser Glu Aan Ser Aan Arg 

930 935 940 

Thr eye Ser Met Pro Tyr Ala Lya Lau Glu Tyr Lys Arg Ser Ser Asn 
945 950 955 960 

Aap Ser Leu Asn Ser Val Ser Ser Ser Aap Gly Tyr Gly Lya Arg Gly 
965 970 975 

Gin Met Lys Pro Ser He Glu Ser Tyr Ser Glu Asp Asp Glu Ser Lys 
980 985 990 

Phe Cys Ser Tyr Gly Cln Tyr Pro Ala Aap Lau Ala Hia Lya lie His 
995 1000 1005 

Ser Ala Asn His Met Aap Asp Asn Asp Gly Glu Leu Asp Thr Pro He 

1010 1015 1020 

Asn Tyr Ser Leu Lys Tyr Ser Asp Glu Gin Leu Asn Ser Gly Arg Cln 
1025 1030 1035 1040 

Ser Pro Ser Gin Asn Glu Arg Trp Ala Arg Pro Lys His He He Glu 
1045 1050 1055 

Asp Glu He Lys Gin Ser Glu Cln Arg Gin Ser Arg Asn Gin Ser Thr 
1060 1065 1070 

Thr Tyr Pro Val Tyr Thr Glu Ser Thr Aap Aap Lys His Leu Lys Phe 
1075 1080 1085 
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Cln pro Hi. Phe Cly Gin Gin Clu Cy. V.l S« Pro Tyr Arg Smr Atg 
1090 1095 llO" 

Cly Al« A.n Cly Sar Glu Thr A.n Arg V.l Cly Sar Aan HI. Cly Ila^ 

1105 1110 11X3 

A.n Cln A.n v.l Sar Cln Ser Leu Cy. Cln Clu A.p A.p Tyr Clu A.p 

112S 1130 

A.p Ly. pro Thr A.n Tyr Ser Clu Arg Tyr Ser Clu Clu Clu Cln Hi. 
H40 1145 XJ.su 

Clu Clu Glu Clu Arg Pro Thr A.n Tyr Sar lie Ly. Tyr A.n Clu Clu 
1155 1160 

Ly. Arg Hi. V.l A.p Gin Pro lie A.p Tyr Ser Lau Ly. Tyr Ala Thr 

1170 "■'S IIBO 

A.p lie pro ser Ser Gin Ly. Cln Ser Phe Ser Phe Ser Ly. Ser Ser 
1185 1190 1195 1200 

sar Cly Cln sar Sar Ly. Thr Clu Hi. Met Sar Ser Ser Ser Clu A.n 

1205 1210 1^15 

Thr ser Thr Pro Ser Ser A.n Ala Ly. Arg Cln A.n Cln Lau Hi. Pro 
1220 1225 i^Ju 

sar sar Ala Cln Ser Arg Sar Cly Cln Pro Cln Ly. Ala Ala Thr Cy. 
1235 1240 1245 

Lv. Val Sar Ser lie A.n Cln Clu Thr lie Cln Jhr Tyr Cy. Val Clu 
^ 1250 1255 1260 

A.p Thr pro lie Cy. Phe Sar Arg Cy. Sar sar Lau Sar Sar Leu Ser 

1265 1270 1275 1280 

ser Ala Clu A.p Clu lie Cly Cya A.n Cln Thr Thr Cln Clu Ala A.p 
1285 1290 1295 

ser Ala A.n Thr Lau Cln He Ala Clu lie Ly. Clu Ly. ll* Cly Thr 
1300 1305 1310 

Arq ser Ala Clu A.p Pro Val Ser Clu Val Pro Ala Val Sar Cln Hi. 
^ 1315 1320 1325 

Pro Arg Thr Ly. Sar Sar Arg Lau Cln Cly Ser far Lau Ser Sar Clu 
1330 1335 1340 

ser Ala Arg Hi. Ly. Ala Val Clu Phe Sar fer Cly Ala Ly. Ser Pro 
1345 13SO 1355 

ser Ly. Sar Cly Ala Cln Thr Pro Ly. Sar Pro Pro Clu Hi. Tyr Val 
1365 1370 

Cln Clu Thr Pro Lau Met Phe Ser Arg Cy. Thr Ser Val Ser Sar Lau 

1380 1385 iJ^u 

Asp ser Phe Clu Ser Arg Ser He Ala Ser Ser Val Cln Ser Clu Pro 
*^ 5^395 1400 1405 

cys ser Cly Met Val Ser Cly^He He Ser Pro Ser^Asp Leu Pro A.p 
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Ser Pro Cly Gin Thr Met Pro Pro Smr Arg Smr Lym Thr Pro Pro Pro 
1425 1430 143S 1440 

Pro Pro Gin Thr Ala Gin Thr Lys Arg Glu Val Pro Lys A«n Lys Ala 
1445 1450 1455 

Pro Thr Ala Glu Lys Arg Glu Ser Gly Pro Lys Gin Ala Ala Val Asn 
1460 1465 1470 

Ala Ala Val Gin Arg Val Gin Val Leu Pro Asp Ala Aap Thr Leu Leu 
1475 1480 1485 

Hia Phe Ala Thr Glu Ser Thr Pro Asp Gly Phe Ser Cys Ser Ser Ser 
1490 1495 1500 

Leu Ser Ala Leu Ser Leu Asp Glu Pro Phe lie Gin Lys Asp Val Glu 
1505 1510 1515 1520 

Leu Arg lie Met Pro Pro Val Gin Glu Asn Asp Asn Gly Asn Glu Thr 
1525 1530 1535 

Glu Ser Glu Gin Pro Lys Glu Ser Asn Glu Asn Gin Glu Lys Glu Ala 
1540 1545 1550 

Glu Lys Thr lie Asp Ser Glu Lys Asp Leu Leu .Asp Asp Ser Asp Asp 

1555 1560 1565 

Asp Asp lie Glu lie Leu Glu Glu Cys He lie Ser Ala Met Pro Thr 
1570 1575 1580 

Lys Ser Ser Arg Lys Ala Lys Lys Pro Ala Gin Thr Ala Ser Lys Leu 
1585 1590 1595 1600 

Pro Pro Pro Val Ala Arg Lys Pro Ser Gin Leu Pro Val Tyr Lys Leu 
1605 1610 1615 

Leu Pro Ser Gin Asn Arg Leu Gin Pro Gin Lys His Val Ser Phe Thr 
1620 1625 1630 

Pro Gly Asp Asp Met Pro Arg Val Tyr Cys Val Glu Gly Thr Pro lie 
1635 1640 1645 

Asn Phe Ser Thr Ala Thr Ser Leu Ser Asp Leu Thr He Glu Ser Pro 
1650 1655 1660 

Pro Asn Glu Leu Ala Ala Gly Glu Gly Val Arg Gly Gly Ala Gin Ser 
1665 1670 1675 1680 

Gly Glu Phe Glu Lys Arg Asp Thr He Pro Thr Glu Cly Arg Ser Thr 
1685 1690 1695 

Asp Glu Ala Gin Gly Gly Lys Thr Ser Ser Val Thr He Pro Glu Leu 
1700 1705 1710 

Asp Aep Asn Lys Ala Glu Glu Gly Asp He Leu Ala Glu Cys He Asn 
1715 1720 1725 

Ser Ala Met Pro Lys Cly Lys Ser His Lys Pro Phe Arg Val Lys Lys 
1730 1735 1740 

He Met Asp Gin Val Gin Glh Ala Ser Ala Ser Ser Ser Ala Pro Asn 
1745 1750 1755 1760 
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Lya Aan Cln Lau A-PgC^Y ^7*0^'° 17^5^^' 

Pro He pro Cln Aan Thr Glu Tyr Arg Thr Arg Val Arg Lya Aan Ala 

1780 1785 1790 

Aap Ser Lya Aan Aan Leu Aan JJj^^^^^" '^^ fsos'^*^ 



1795 



Aap ser Lya Lya Cln Aan Leu Lya Aan Aan Ser Lya Aap Phe Aan Aap 

1810 1815 1820 

Lya Leu Pro Aan Aan Clu^Aap Arg Val Arg Cly^Ser Phe Ala Phe Aap^ 

ser Pro Hia Hie Tyr Thr Pro He Clu Cly Thr Pro Tyr Cya f he Ser 
1845 1850 1855 

Arg Aan Aap Ser Leu Ser Ser Leu Aap Phe Aap Aap Aap Aap Val Aap 

I860 1865 1870 

Leu ser Arg Clu Lya Ala Clu Leu Arg Lya Ala Lya Clu Aan Lya Clu 
1875 1880 1885 

Smn Clu Ala Lya Val Thr Ser Hia Thr Clu Leu Thr Ser Aan Cln Cln 

1890 1895 1900 

ser Ala Aan Lya Thr Cln Ala He Ala Lya Cln Pro He Aan Arg Cly 
190S 1910 1915 1920 

Gin Pro Lya Pro He Leu Cln Lye Cln Ser Thr Phe Pro Cln Ser ser 
1925 1930 1935 

Lye Aap He Pro Aap Arg Cly Ala Ala Thr Aap Clu Lya teu Cln Aan 

1940 1945 1950 

Phe Ala He Clu Aan Thr Pro Val Cya Phe Ser Hia Aan Ser Ser Leu 

1955 1960 1965 

ser ser Leu Ser Aap He Aap Cln Clu Aan Aan Aan Lya Clu Aan Clu 
1970 1975 1980 

Pro He Lya Clu Thr Clu^Pro Pro Aap Ser CJ^g^^^y J^Jq 

pro Gin Ala Ser Cly Tyr Ala Pro Lya Ser Phe Hie Val Clu Aap Thr 

2005 2010 2015 

Pro Val Cya Phe^Ser Arg Aan Ser S^l^^" 2030^^" 

ser Clu Aap Aap Leu Leu Cln Clu Cya He Ser Ser Ala Met Pro Lya 
2035 2040 2045 

Lye Lys Lya Pro Ser Arg Leu Lya Cly Aap Aan Clu Lya Hia Ser Pro 

^ 2050 2055 2060 

Arg Aan Met Cly Cly He Leu Cly Clu Aap Leu Thr Leu Aap Leu Lya 
2065 2070 2075 2QBO 

Aap He Cln Arg Pro Aap Ser Clu Hia Cly Leu Ser Pro Aap Ser Clu 

2085 2090 ^u^D 
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Aan Pho A«p Trp Ly« Al* Ilo Cln Clu Cly Al« A«n Smr llm Val Sor 

2100 2105 2110 

Ser Lau His Cln Ala XI* Ala Ala Ala Cya Lau Sar Arg Gin Ala Smr 

2115 2120 2125 

Ser Asp Ser Aap Sar lie Lau Sar Lau Lys Ser Cly lie Ser Leu Cly 
2130 2135 2140 

Ser Pro Phe His Leu Thr Pro Aap Cln Clu Clu Lys Pro Phe Thr Ser 
2145 2150 2155 2160 

Asn Lys Cly Pro Arg lie Lau Lya Pro Cly Clu Lys Sar Thr Leu Clu 
2165 2170 2175 

Thr Lys Lys lie Clu Ser Clu Ser Lya Cly He Lys Cly Cly Lys Lys 
2180 2185 2190 

Val Tyr Lys Ser Leu lie Thr Cly Lys Val Arg Ser Asn Ser Clu He 
2195 2200 2205 

Ser Cly Cln Met Lys Cln Pro Leu Gin Ala Asn Met Pro Ser lie Ser 
2210 2215 2220 

Arg Cly Arg Thr Mat He His He Pro Cly Val Arg Asn Sar Ser Ser 
2225 2230 2235 2240 

Ser Thr Ser Pro Val Ser Lys Lys Gly Pro Pro Leu Lys Thr Pro Ala 
2245 2250 2255 

Ser Lys Ser Pro Ser Clu Cly Cln Thr Ala Thr Thr Ser Pro Arg Gly 
2260 2265 2270 

Ala Lys Pro Sar Val Lys Ser Clu Lau Ser Pro Val Ala Arg Cln Thr 
2275 2280 2285 

Ser Gin He Cly Cly Ser Ser Lys Ala Pro Ser Arg Ser Cly Ser Arg 
2290 2295 2300 

Asp Ser Thr Pro Ser Arg Pro Ala Cln Gin Pro Leu Ser Arg Pro He 
2305 2310 2315 2320 

Gin Ser Pro Gly Arg Asn Ser He Ser Pro Cly Arg Asn Cly He Ser 
2325 2330 2335 

Pro Pro Asn Lys Leu Ser Cln Leu Pro Arg Thr Ser Ser Pro Ser Thr 
2340 2345 2350 

Ala Ser Thr Lys Ser Ser Cly Ser Gly Lys Met Ser Tyr Thr Ser Pro 
2355 2360 2365 

Cly Arg Cln Met Ser Cln Cln Asn Leu Thr Lys Gin Thr Cly Leu Ser 
2370 2375 2380 

Lys Asn Ala Ser Ser He Pro Arg Ser Clu Ser Ala Ser Lys Cly Leu 
2385 2390 2395 2400 

Asn Cln Met Asn Asn Cly Asn Cly Ala Asn Lys Lys Val Clu Leu Ser 
2405 2410 2415 

Arg Met Ser Ser Thr Lys Ser Ser Gly Ser Glu Ser Asp Arg Ser Clu 
2420 2425 2430 
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Ara Pro V«l Lmxi VaI Xrg Gin Ser Thr Phe lie Lya "u Al* Pro Ser 
2435 2440 2445 

Pro Thr Leu Arg Arg Lye Leu Clu Clu Ser Al* fer Phe Clu Ser Leu 
2450 24S5 2460 

ser Pro Ser Ser Arg Pro Ala Ser Pro Thr Arg Ser Gin Ala Gin Thr 
2465 2470 2475 2480 

Pro Val Leu Ser Pro Ser Leu Pro Aap Met Ser Leu Ser Thr Hie Ser 
2485 2490 2495 

ser Val Gin Ala Cly Cly Trp Arg Lye Leu Pro Pro Aen Leu Ser Pro 
2500 2505 2510 

Thr He Glu Tyr Aen A«p Gly Arg Pro Ala Lye Arg Hie Aep He Ala 
2515 2520 2525 

Ara Ser Hie Ser Glu Ser Pro Ser Arg Leu Pro He Aen Arg Ser Cly 
^ 2530 2535 2540 

Thr Trp Lye Arg Glu Hie Ser Lys Hie Ser Ser Ser Leu Pro Arg Val 
2545 2550 2555 2560 

ser Thr Trp Arg Arg Thr Gly Ser Ser Ser Ser He Leu Ser Ala Ser 
2565 2570 2575 

Ser Glu ser Ser Glu Lye Ala Lye Ser Clu Aep Clu Lys Hie Val Aen 
2580 2585 2590 

Ser He Ser Gly Thr Lye Gin Ser Lye Glu Aen Gin Val Ser Ala Lye 
2595 2600 2605 

Civ Thr Trp Arg Lye He Lye Glu Aen Glu Phe Ser Pro Thr Aen Ser 
^ 2610 2615 2620 

Thr ser Gin Thr Val Ser Ser Cly Ala Thr Aan Cly Ala Glu Ser Lye 
2625 2630 2635 2640 

Thr Leu He Tyr Gin Met Ala Pro Ala Val Ser Lye Thr Clu Aep Val 
2645 2650 2655 

Trp Val Arg He Clu Aep Cye Pro He Aen Aan Pro Arg Ser Gly Arg 
2660 2665 2670 

Ser Pro Thr Gly Aen Thr Pro Pro Val He Aep Ser Val Ser Clu Lye 
^ 2680 2685 



2675 



Ala Aen Pro Aen He Lye Aep Ser Lye Aep Aen Gin Ala Lye Gin Aen 

2690 2695 2700 

Val Gly Aen Gly Ser Val Pro Met Arg Thr Val Gly Leu Glu Aen Arg 
2705 2710 2715 2720 

Leu Aen Ser Phe He Gin Val Aep Ala Pro Aap Gin Lys Cly Thr Clu 
2725 2730 2735 

He LVB Pro Gly Gin Aen Asn Pro Val Pro Val Ser Glu Thr Aen Glu 
^ 2740 2745 2750 

Ser ser He Val Glu Arg Thr Pro^Phe Ser Ser Ser Ser^Ser Ser Lys 



2755 
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His S«r S«r Pro Sar Cly Thr Val Al* Al* Arg VaI Thr Pro Pho A«n 

2770 2775 2780 

Tvr Asn pro Sor Pro Arg Ly« Sar Sar Al* A«p Ser Thr Sar Ala Arg 

2785 2790 2795 2800 

Pro ser Gin He Pro Thr Pro Val Aan Aan Aen Thr Lya Lys Arg Aap 
2805 2810 2815 

ser Lya Thr Aap Sar Thr Clu Sar Sar Cly Thr Gin Sar Pro Lya Arg 
2820 2825 2830 

Hla Sar Gly Sar Tyr Lau Val Thr Sar Val 
2835 2840 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : aingla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

<B) CLONE: ral2(yea8t) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lau Thr Cly Ala Lya Cly Lau Gin Lau Arg Ala Leu Arg Arg lie Ala 

15 10 15 

Arg lie Glu Gin Gly Gly Thr Ala lie Ser Pro Thr Ser Pro Leu 
20 25 30 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: aingla 

(D) TOPOIXX2Y: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiena 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: m3(mAChR) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Leu Tvr Trp Arg lie Tyr Lya Clu Thr Glu Lya Arg Thr Lya Glu Leu 
1 5 10 15 
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XI* Cly L«u Cln Al* Smr Cly Thr Clu Al« Clu Thr Clu 



20 

(2) INFORMATION TOR SEQ ID NO: 10: 

I i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : aingl* 

(D) TOPOLOCY: linear 

(ii) MOLECULE TYPE: paptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiana 

<vii) IMMEDIATE SOURCE: 

(B) CLONE: MCC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Leu Tyr Pro A.n Lau Ala Clu Clu Arg Sar Arg Trp Glu Lya Clu Lau 

1 5 10 . 

Ala Cly Lau Arg Clu Clu Aan Clu Sar Lau Thr Ala Mat 

20 25 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 baaa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOCY: linear 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

<X) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

40 

CTATCAAGAC TCTGACTTTT AATTCTAGTT TATCCATTTT 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
<B) TYPE: nuclaic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapxena 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTTAGAATTT CATCTTAATA TATTGTCTTC TTTTTAACAC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 baae pair* 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTAGATTTTA AAAAGGTGTT TTAAAATAAT TTTTTAACCT 40 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hofflo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AAGCAATTGT TGTATAAAAA CTTGTTTCTA TTTTATTTAG 40 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTAACTTTTC TTCATATAGT AAACATTGCC TTCTGTACTC 



40 
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(2) INFORMATION FOR SEQ ID NO: 16: 

lii SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 40 b««e pair* 

(B) TYPE: nucleic «cid 

Jc) STRAliOEONESS: • Ingle 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

fvi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo eapiena 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
HWINNNNNNN NHNGTCCCTT TTTTTAAAAA AAAAAAATAG 
(2) INFORMATION FOR SEQ ID NO: 17: 

ril SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 40 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: • ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17. 

40 

CTAACTAACT TGCCAGTACA ACTTATTTCA AACTTTAATA 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i> SEQUENCE CHARACTERISTICS J 

(A) LENGTH: 40 baee paira 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CONA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo eapiena 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

40 

ATACAAGATA TTCATACTTT TTTATTATTT GTGGTTTTAG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii> MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTAAGTTACT TCTTTCTAAG TGATAAAACA GYGAAGAGCT 40 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bsstt pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
AATAAAAACA TAACTAATTA GGTTTCTTGT TTTATTTTAG 40 
(2) INFORMATION FOR SEQ ID NO: 21: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hofflo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GTTAGTAAAT TSCCTTTTTT CTTTGTGGGT ATAAAAATAG 40 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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40 



(xl) SEQUENCE DESCRIPTION! SEQ ID KO:22i 
ACCATTTTTG CATCTACTCA TGTTAACTCC ATCTTAACAC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bA«o pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: • ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GTAAATAAAT TATTTTATCA TATTTTTTAA AATTATTTAA 
(2) INFORMATION FOR SEQ ID MO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 baae pair a 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CATGATGTTA TCTGTATTTA CCTATAGTCT AAATTATACC ATCTATAATG TCCTTAATTT 

TTAC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GTAACAGAAC ATTACAAACC CTGGTCACTA ATCCCATGAC TACTTTGCTA AG 52 



60 
64 
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(2) INFORMATION FOR SEQ ID N05 26: 

<i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH! 46 base palra 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : •ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDHA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiena 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CGATATTAAA GTCGTAATTT TGTTTCTAAA CTCATTTCGC CCACAC 
(2> INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27j 

GTATCTTCTC TATAGTGTAC ATCGTAGTGC ATGTTTCAAA 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECX7LE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hocnc sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CATCATTGCT CTTCAAATAA CAAAGCATTA TGGTTTATCT TGATTTTATT TTTCAG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(li) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aaplen* 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTAACACAAA AATGTTTTTT AATCACATAC ACAATTACTC CTC 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 40 bA»e pairs 

(B) TYPE: nucl«ic acid 

(C) STRAHDEONESS: aingl^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiana 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TTAGATCATT GTCTTTTTCC TCTTGCCCTT TTTAAATTAG 
<2) INFORMATION FOR SEQ ID NO: 31: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 ba»« pairs 

(B) TYPE: nucl«ic acid 

(C) STRANDEDNESS : singls 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTATGTTTTT ATAACATCTA TTTCTTAAGA TAGCTCACGT ATCA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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<xl) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CCTTGCCTTC AACTTCNCTT TTTAATCATC CTCTATTCTC TATTTAATTT ACAC 54 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTACTATTTA GAATTTCACC TGTTTTTCTT TTTTCTCTTT TTCTTTGAGG CAGGGTCTCA 60 

65 

CTCTG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY I linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCAACTAGTA TGATTTTATG TATAAATTAA TCTAAAATTG ATTAATTTCC AG 52 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CTACCTTTGA AAACATTTAG TACTATAATA TCAATTTCAT CT 42 



SUBSTITUTE SHEET 



wo 92/13103 



-92- 



PCT/US92/00376 



(2) INFORMATION FOR SEQ ID NO:36t 

<1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANOBDMESS: aingla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiena 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCAACTCNAA TTAGATCACC CATATTCACA AACTTACTAG 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 baae pairs 

(B) TYPE: nuclaic acid 

(C) STRANDE0NE5S : single 

(D) TOPOLOGY: linear 

(ii) MOLEOn-E TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CTATATATAC AGTTTTATAT TACTTTTAAA GTACAGAATT CATACTCTCA AAAA 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ATTCTGACCT TAATTTTGTC ATCTCTTGAT TTTTATTTCA G 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



suBSTrruTE sheet 



wo 92/13103 PCr/US92/00376 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo •api«n« 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TCCCCGCCTC CCCCTCTC 
(2) INFORMATION FOR SEQ ID NO: 40: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 basa palrfl 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: singla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GCAGCGGCGG CTCCCGTG 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 basa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY I linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
GTGAACGGCT CTCATGCTCC 20 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOx42: 
ACCTCCGCCC ACCAATCCA 
<2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 baae paira 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : aingl« 

(D) TOPOLOGY: linaar 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiena 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ATGATATCTT ACCAAATCAT ATAC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 baae paira 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: aingla 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiena 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TTATTCCTAC TTCTTCTATA CAG 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 baae paira 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: a ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiena 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
TACCCATGCT GGCTCTTTTT C 



SUBSTITUTE SHPrT- 



' WO?>2/I3I03 



-95- 



PCr/US92/00376 



(2) INFORMATION FOR SEQ ID NO? 46: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : BLnglm 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: HofflO sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TGGGGCCATC TTCTTCCTGA 20 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
ACATTAGGCA CAAAGCTTGC AA 22 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(O) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
ATCAAGCTCC AGTAAGAAGG TA 22 
(2> INFORMATIOK FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo •apl«n« 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
TCCGGCTCCT CGGTTGTTG 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 b««« pairs 

(B) TYPE: nucl«ie acid 

(C) STRANDEONESS s • ingle 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiena 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCCCCTTCCT TTCTCAGGAC 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 baaa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEONESS: sing la 

(D) TOPOLOGY: linaar 

(ii) MOLECtJLE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TTTTCTCCTG CCTCTTACTC C 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 basa pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ATCACACCCC CCATTCCCTC 20 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: S3: 
CCACTTAAAG CACATATATT TACT 24 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GTATGGAAAA TAGTCAAGAA CC 22 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
TTCTTAAGTC CTGTTTTTCT TTTG 24 
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(2) INFORMATION FOR SEQ ID NO:56i 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 ba«« patra 

(B) TYPE: nucl«ic acid 

(C) STRAMOEDMESS: •tngla 
<D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TTTAGAACCT TTTTTGTCTT CTG 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CTCAGATTAT ACACTAAGCC TAAC 
(2) INFORMATION FOR SEQ ID NO: 58: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
CATGTCTCTT ACAGTAGTAC CA 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



SUBSTITUTE SHEET 



WO*^92/13103 



-99- 



(ii) MOLECULE TYPE: cOKA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
ACCTCCAAGG CTACCCAAGG 
(2) INFORMATION FOR SEQ ID NO: 60: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 
TAAAAATGGA TAAACTACAA TTAAAAC 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
AAATACAGAA TCATGTCTTC AACT 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
XCACCTAAAG ATGACAATTT GAG 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 baae pair« 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TAACTTAGAT ACCAGTAATT TCCC 
<2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
ACAATAAACT GGAGTACACA AGO 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
ATAGCTCATT GCTTCTTGCT CAT 
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(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 24 b««e pairs 

(B) TYPE: nuclaic «cid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hotno sapians 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
TGAATTTTAA TGGATTACCT AGGT 24 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: • ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CTTTTTTTGC TTTTACTGAT TAACG 25 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
TGTAATTCAT TTTATTCCTA ATACCTC 27 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cONA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CCTAGCCATA GTATCATTAT TTCT 
<2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 baae pairs 

(B) TYPE: nucl«ic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: lin«ar 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo aapienfl 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CTACCTATTT TTATACCCAC AAAC 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 
AAGAAAGCCT ACACCATTTT TGC 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CATCATTCTT ACAACCATCT TCC 
(2) INFORMATION TOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 ba«o pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
ACCTATAGTC TAAATTATAC CATC 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
CTCATGGCAT TAGTGACCAG 
(2) INFORMATION FOR SEQ ID NO: 75: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
AGTCGTAATT TTGTTTCTAA ACTC 
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(2) INFORMATION FOR SEQ ID NO: 76 s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bftM p*ir« 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: slngla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 
TGAACGACTC GCATTTCACG C 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
TCATTCACTC ACAGCCTGAT CAC 
(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECUM TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CCTTTGAAAC ATGCACTACG AT 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii> MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCES 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
AAACATCATT CCTCTTCAAA TAAC 2 4 

<2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECXn-E TYPE: cDNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TACCATCATT TAAAAATCCA CCAG 24 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GATGATTGTC TTTTTCCTCT TGC 23 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:82j 
CTCAGCTATC TTAACAAATA CATC 
(2) INFORMATION FOR SEQ ID NO: 83: 

<i) SEQCJENCE CHARACTERISTICS: 

(A) LENGTH: 25 bas* pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : aingla 

(D) TOPOLOGY: linear 

(ii) KOLECCLE TYPE: CONA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo aapiana 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 
TTTTAAATCA TCCTCTATTC TGTAT 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 basa paira 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: singla 
(0) TOPOLOGY: linaar 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
ACAGAGTCAG ACCCTGCCTC AAAG 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 basa pairs 

(B) TYPE: nuclttic acid 

(C) STRANDEDNESS: singla 

(D) TOPOLOGY: Xinaar 

(ii) TOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
TTTCTATTCT TACTCCTAGC ATT 
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(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
ATACACAGGT AAGAAATTAG GA 22 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
TAGATGACCC ATATTCTGTT TC 22 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
CAATTAGGTC TTTTTGAGAG TA 22 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cONA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapl«n« 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
GTTACTGCAT ACACATTGTG AC 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bM« pAlr« 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

23 

GCTTTTTCTT TCCTAACATC AAG 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

21 

TCTCCCACAG GTAATACTCC C 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOOTCE: 

(A) ORGANISM: Homo sapxens 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOt92x 
CCTACAACTC AATCCGCTAC G 21 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNCSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECUU TYPE: cDNA 

(vi) ORICINAI. SOURCE: 

(A) ORGANISM: HoffiO sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
CAGGACAAAA TAATCCTGTC CC 22 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ATTTTCTTAC TTTCATTCTT CCTC 24 
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CLAIMS 

1. A method of diagnosing or prognosing a neoplastic tissue 
of a human, comprising; 

detecting somatic alteration of wild-type APC gene cod- 
ing sequences or their expression products in a tumor tissue isolated 
from a human, said alteration indicating neoplasia of the tissue. 

2. The method of claim l wherein the expression products 
are mRNA molecules. 

3. The method of claim 2 wherein the alteration of 
wild-type APC mRNA is detected by hybridization of mRNA from said 
tissue to an APC gene probe. 

4. The method of claim 1 wherein alteration of wild-type 
APC gene coding sequences is detected by observing shifts in 
electrophoretic mobiMty of single-stranded DMA on non-denaturing 
polyacrylamide gels, 

5. The method of claim l wherein alteration of wild-type 
APC gene coding sequences is detected by hybridization of an APC 
gene coding sequence probe to genomic DNA Isolated from said tissue, 

6. The method of claim 5 further comprising: 

subjecting genomic DNA isolated from a non-neoplastic 
tissue of the human to Southern hybridization with the APC gene cod- 
ing sequence probe; and 

comparing the hybridizations of the APC gene probe to 
said tumor and non-neoplastic tissues. 

7. The method of claim 5 wherein the APC gene probe 
detects a restriction fragment length polymorphism. 

8. The method of claim l wherein the alteration of 
wild-type APC gene coding sequences is detected by determining the 
sequence of all or part of an APC gene in said tissue using a polymerase 
chain reaction, deviations in the APC sequence determined from that 
of the sequence shown In Figure 7 (SEQ ID NO.: 1) suggesting neoplasia. 

9. The method of claim 1 wherein the alteration of wild- 
type APC gene coding sequences is detected by identifying a mismatch 
between molecules (1) an APC gene or APC mRNA isolated from said 
tissue and (2) a nucleic acid probe complementary to the human wild- 
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type A PC gene coding sequence, when molecules (1) and (2) are hybrid- 
ized to each other to form a duplex. 

10. The method of claim 5 wherein the A PC gene probe 
hybridizes to an exon selected from the group consisting of: (1) 
nucleotides 822 to 930; and (2) nucleotides 931 to 1309; (3) nucleotides 
1406 to 1545; and (4) nucleotides 1956 to 2256, 

11. The method of claim 1 wherein the alteration of wild- 
type APC gene coding sequences is detected by amplification of A PC 
gene sequences in said tissue and hybridization of the amplified APC 
sequences to nucleic acid probes which comprise APC sequences. 

12. The method of claim 1 wherein the alteration of 
wild- type APC gene coding sequences is detected by molecular cloning 
of the APC genes in said tissue and sequencing all or part of the cloned 
APC gene. 

13. The method of claim 1 wherein the detection of alter- 
ation of wild-type APC gene coding sequences comprises screening for 
a deletion mutation. 

14. The method of claim 1 wherein the detection of alter- 
ation of wild- type APC gene coding sequences comprises screening for 
a point mutation. 

15* The method of claim 1 wherein the detection of alter- 
ation of wild-type APC gene coding sequences comprises screening for 
an insertion mutation. 

16. The method of claim 1 wherein the tumor tissue is a 
colorectal tissue. 

17. The method of claim 6 wherein the non-neoplastic tissue 
isolated from a human is from colonic mucosa. 

18. The method of claim 1 wherein the expression products 
are protein molecules. 

19. The method of claim 18 wherein the alteration of 
wild-type APC protein is detected by immunoblotting. 

20. The method of claim 18 wherein the alteration of 
wUd-type APC protein is detected by immunocytochemistry. 
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21. The method of claim 18 wherein the alteration of 
wild-type APC protein is detected by assaying for binding interactions 
between APC protein of said tumor tissue and a second cellular protein. 

22. The method of claim 21 wherein the second cellular pro- 
tein is selected from the group consisting of MCC protein, wild-type 
APC protein, and a G protein. 

23. The method of claim 18 wherein the alteration of 
wild-type APC protein Is detected by assaying for phospholipid 
metabolites. 

24. A method of supplying wild- type APC gene function to a 
cell which has lost said function by virtue of a mutation in an APC 
gene, comprising: 

Introducing a wild-type APC gene into a ceU which has 
lost said gene function such that said wild- type APC gene is expressed 
in the cell. 

25. The method of claim 24 wherein the wild-type APC gene 
introduced recombines with the endogenous mutant APC gene present 
in the cell by a double recombination event to correct the APC gene 
mutation. 

26. A method of supplying wild-type APC gene function to a 
cell which has altered APC function by virtue of a mutation in an APC 
gene, comprising: 

introducing a portion of a wild-type APC gene into a cell 
which has lost said gene function such that said portion Is expressed in 
the cell, said portion encoding a part of the APC protein which is 
required for non-neoplasfic growth of said cell. 

27. A method of supplying wild-type APC gene function to a 
cell which has altered APC function by virtue of a mutation in an APC 
gene, comprising: 

applying human wild- type APC protein to a cell which has 
lost wild-type APC function. 

28. A method of supplying wild-type APC gene function to a 
cell which has altered APC gene function by virtue of a mutation in an 
APC gene, comprising: 
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introducing Into the cell a molecule which mimics the 
function of wild-type APC protein. 

29. A pair of single stranded DNA primers for determination 
of a nucleotide sequence of an APC gene by polymerase chain reaction, 
the sequence of said primers being derived from chromosome 5q band 
21, wherein the use of said primers in a polymerase chain reaction 
results in synthesis of DNA having all or part of the sequence shown in 
Figure 7. 

30. The primers of claim 29 which have restriction enzyme 
sites at each 5' end. 

31. The pair of primers of claim 29 having sequences corre- 
sponding to APC Introns. 

32. A nucleic acid probe complementary to human wild-type 
APC gene coding sequences. 

33. The nucleic acid probe of claim 31 which hybridizes to an 
exon selected from the group consisting of: (l) nucleotides 822 to 930; 
and (2) nucleotides 931 to 1309; (3) nucleotides 1406 to 1545; (4) 
nucleotides 1956 to 2256. 

34. A kit for detecting alteration of wild-type APC genes 
comprising a Ijattery of nucleic acid probes which in the aggregate 
hybridize to all nucleotides of the APC gene coding sequences. 

35. A method of detecting the presence of a neoplastic tissue 
in a human, comprising: 

detecting in a body sample isolated from a human alter- 
ation of a wild-type APC gene coding sequence or wild-type APC 
expression product, said alteration indicating the presence of a 
neoplastic tissue in the human. 

36. The method of claim 35 wherein said body sample is 
selected from the group consisting of serum, stool, urine and sputum. 

37. A method of detecting genetic predisposition to cancer, 
including familial adenomatous polyposis (FAP) and Gardner's Syndrome 
(GS), in a human comprising: 

detecting a germline alteration of wild-type APC gene 
coding sequences or their expression products in a human sample 
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selected from the group consisting of blood and fetal tissue, said alter- 
ation Indicating predisposition to cancer. 

38. The method of claim 37 wherein the expression products 
are mRNA molecules. 

39. The method of claim 38 wherein the alteration of 
wild-type A PC mRNA is detected by hybridization of mRNA from said 
tissue to an A PC gene probe. 

40. The method of claim 37 wherein alteration of wUd-type 
A PC gene coding sequences is detected by observing shifts in 
electrophoretic mobility of single-stranded DNA on non-denaturing 
polyacrylamide gels, 

41. The method of claim 37 wherein alteration of wild-type 
APC gene coding sequences is detected by hybridization of an APC 
gene coding sequence probe to genomic DNA isolated from said tissue. 

42. The method of claim 41 wherein the APC gene coding 
sequence protse detects a restriction fragment length polymorphism. 

43. The method of claim 37 wherein the alteration of 
wild-type APC gene coding sequences is detected by determining the 
sequence of all or part of an APC gene in said tissue using a polymerase 
chain reaction* deviations in the APC sequence determined from the 
sequence of Figure 7 stiggesting predisposition to cancer. 

44. The method of claim 37 wherein the alteration of wild- 
type APC gene coding sequences is detected by identifying a mismatch 
between molecules (1) an APC gene or APC mRNA isolated from said 
tissue and (2) a nucleic acid probe complementary to the human wild- 
type APC gene coding sequence, when molecules (1) and (2) are hybrid- 
ized to each other to form a duplex. 

45. The method of claim 41 wherein the APC gene probe 
hybridizes to an exon selected from the group consisting of: 
(1) nucleotides 822 to 930; and (2) nucleotides 931 to 1309; (3) 
nucleotides 1406 to 1545 and (4) nucleotides 1956 to 2256. 

46. The method of claim 37 wherein the alteration of wild- 
type APC gene coding sequences is detected by amplification of APC 
gene sequences in said tissue and hybridization of the amplified APC 
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sequences to nucleic acid probes which comprise A PC gene coding 
sequences. 

47. The method of claim 37 wherein the alteration of 
wild- type AFC gene coding sequences is detected by molecular cloning 
of the A PC genes In said tissue and sequencing all or part of the cloned 
A PC gene, 

48. The method of claim 37 wherein the detection of alter- 
ation of wild-type APC gene coding sequences comprises screening for 
a deletion mutation. 

49. The method of claim 37 wherein the detection of alter- 
ation of wild- type APC gene coding sequences comprises screening for 
a point mutation. 

50. The method of claim 37 wherein the detection of alter- 
ation of wild-type APC gene coding sequences comprises screening for 
an Insertion mutation, 

51. The method of claim 37 wherein the expression products 

are protein molecules. 

52. The method of claim 51 wherein the alteration of 
wild-type APC protein is detected by immunoblotting. 

53. The method of claim 51 wherein the alteration of 
wild-type APC protein is detected by immunocytochemistry. 

54. The method of claim 51 wherein the alteration of 
wild-type APC protein is detected by assaying for binding interactions 
between APC protein Isolated from said tissue and a second cellular 
protein. 

55. The method of claim 54 wherein the second ceUular pro- 
tein is selected from the group consisting of MCC protein, wild-type 
APC protein and a G protein, 

56. A method of screening for genetic predisposition to can- 
cer, including f amiUal adenomatous polyposis (FAP) and Gardner's Syn- 
drome (GS), in a human comprising: 

detecting among kindred persons the presence of a DNA 
polymorphism which is linked to a mutant APC allele In an Individual 
having a genetic predisposition to cancer, said kindred being 
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genetically related to the individual, the presence of said polymorphism 
suggesting a predisposition to cancer. 

57. A preparation of the human APC protein substantially 
free of other human proteins, the amino acid sequence of said protein 
corresponding to that shown in Figure 3 or 7 (SEQ ID NO: 1). 

58. A preparation of antibodies Immunoreactive with a 
human APC protein and not substantially immunoreactive with other 
human proteins. 

59. A method of testing therapeutic agents for the ability to 
suppress a neoplastically transformed phenotype, comprising: 

applying a test substance to a cultured epithelial cell 
which carries a mutation in an APC allele; 

determining whether said test substance suppresses the 
neoplastically transformed phenotype of the cell, 

60. The method of claim 59 wherein the cultured epithelial 
cell has been genetically engineered to carry the mutation in the APC 
allele. 

61. A method of testing therapeutic agents for the ability to 
suppress neoplastic growth, comprising: 

administering a test substance to an animal which carries 
a mutant APC allele In its genome; 

determining whether said test substance prevents or sup- 
presses the growth of tumors. 

62. A transgenic animal which carries a mutant APC allele 
from a second animal species in its genome. 

63. An animal which has been genetically engineered to con- 
tain an insertion mutation which disrupts an APC allele in its genome. 

64. A cDNA molecule which encodes a protein having the 
amino acid sequence shown in Figure 3 or 7 (SEQ ID NO: 7 or l). 

65. An isolated DNA molecule which encodes a protein having 
the amino acid sequence shown in Figure 3 or 7 (SEQ ID NO: 7 or 1). 

66. A yeast artificial chromosome which is known as 37HG4. 
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TABLE HA 

Germllne mutations of the A PC gene in FAP and GS Patients 



EXTRA-COLONIC 
PATIENT CODON 
DISEASE 



93 
Ostaomd 

24 

34 
Tumor 

21 

Osccoma 

60 
Ostaoiu 



279 

301 
301 

413 

712 



NUCLEOTIDE AMINO 
CHANGE CHANGE 



TCA->TGA 



CGA->TGA 
CGA->TGA 



CGC->TGC 



TCA->TGA 



Ser->Stop 

Arg->Stop 
Arg->Stop 

Arg->cy» 

S«r->Stop 



ACID 



AGE 



39 Handibular 

46 None 

27 0«scDoid 

24 Handibular 

37 Kandlbular 



3746 
3460 
3827 
3712 



243 
301 
456 
500 



CAGAC->CAC 
CCA->TGA 
CTTTCA->CTTCA 
T->C 



splice- juncc ion 
Arg->Stop 
fraoMshlf t 
Tyr->Stop 



* The mutated nucleotides are underlined. 
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TABLE HB 



Somatic Mutations in Sporadic CRC Patients 

lAnEhTT CODQN' ^JirrLEOTIDE CHANflF AMINO ACTP CHANr.F 

r35 MCC 12 CAC/ttun-> (Splice Donor) 

CAC/ttaaaa 

TI6 MCC 145 ctcat/CGA-> (Splice Acceptor) 

Itcat/GCA 

T47 MCC 267 CGG->CIG Ar|»Uu 

Til MCC 490 TCO->TIG Ser.>Uu 

T35 MCC 506 C0G->CAG Arf>Gla 

T91 MCC 698 GCT->GIT Ala->Val 

T34 APC2SS CCAGT->CCCAfiCCAGT Onsertion) 

-n7 APC331 CGA->IGA Arf>Stop 

T13S APC437 CAA/ttaa*>CAA/t£aa (Splice Donor) 

7201 APC I33S CAG->IAG Cln»Stop 



For splice site mutations, the codon nearest to the muutioa is listed 

The underlined nucleotides were muttnt; small case letters represent introns. large case leners represent ^-nr 
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TABLE m 



S#qu#nc^< of Pntn^n Us#d for SSCP A/\Mtym 



991 



I «*^<7rcAACftcc7CTCArccrec 

• v^AcccArQcreccTcrrrm 
I TT->riTTfcrrrir>Mi i i nn u 



— rrrrnrrtrrurniu 



I 

I 

J 

< 

f 



mi* 

qy-t rnrw i Tr i MuuJiiwuy 



I 



f 

r 
J 
I 

.1 
i 
J 




PWtCTTrrtCCTCTTBC 



intBirrnnrrntu n iiuj 



O^^-^mtAOCATACACAnMUtf 

v^AcncMeAf^eooutimM* 

*^**"^niTirTii-|<-ri mm i 



LiiiiJUiiXUkXAACAfVAAtf* 




AAAmAa* 

W > Mil f I CtMTtTT fU^TO JCf 1 * 
**"^TriCTC^PCAfTAt!ATTTnLTQCCA« 
TTAfAtfTBAACB* ft^-AtfCTCAMAtfAAAaASVAflUM* 

^ Vy-ATCroeCTCCAAAAmOTK* IMCCAttrTOCA^ACTTTCWW. 

•^*«TAAAtCCTCCACTTCACA«d« — 1 TITriTlTlll 

«^-«*CCC?CATCrCT»tf?TCr9C* 
l»*rTgTOCTA»CCTTm cAlM 

I ~-irrriiTrrmTBri iim njuui 

» W»-AACATCA€CrCT?CCAC4AAW 

TMTAAaTMTTTA^AraUM* I H ri (Kf | CI f Ul ACtTOtf^A* 



ft^ArmctAArrrzTCAraktfXTac* 

W-CTTTTTTOCfiAt 



•^•^TTCACTCCCCTACtAAacfctf • 



•« ff^-ocTCTvercaeoeAAAcyc* 

Vy-fOCTJUmCJkCCQLArAAAAMS* 
•t ■ > * m!WlHA TC r jyCACAn UiAT> * tP»4 

• 4rrrrcrr Atfi— cAtTcrrccTC 



79* r%rr i r t,fcAATAA rc et c i ct . 



p«tr fU« r ol tA« cxoA It ftmo4m«s: tn« t#eo/^<s 3hm#r fl« y of tho «»c 
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TABLE IV 



S«v«n Oifftr«nt Vtrsions of th« 20-Amino Acid R«p««t 
Consensus: F'VE'TP'CFSR-SSUSSLS 

1262: YCV60TPI CFSRCSSLSSLS 

1378: HYVQETPLMFSRCTSVSSLD 

1492: FATESTPOOFSCSSSLSALS 

1$43: YCVEQTPI NFSTATSLSOLT 

1844: TPI EQTPYCFSRNOSLSSLO 

1953: FAI ENTPVCPSHNSSLSSLS 

20':3: FH 7 60TPVCFSRNSSLSSLS 

Numb«rs dsnete p« first amno aod of «ac.*) rB0«aL Tha coriMnaua 
^e^uenca it t^• tsp f«nac:s a rrajonty amno ae:d at a givan position. 
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lnUrna(lon4l Appllc«1lon No: f 


cHEC'D ) 6 MAR 1992 


MICROORGANISMS 

OpU*n*l In c»'tA*<ll«A U M9« --22 23. 


WlPO PCT 




• 1 |h« ^••crfpUM 1 





A. lOfKTiriCATIO*! Of DCPOSIT • 



NATIONAL COLLECTION OF INDUSTRIAL AND MARINE BACTERIA (NCIMB) 







Aberdeen AB2 IRY, Scotland 




United Kingdom 






17 December 1990 


NCIHB 40353 





Saccharomyces cerevisiae SC/37He4 



In respect to those designations in which a European Patent 
is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the 
European patent or until the date on which the. appl ication has been 
refused or withdrawn or Is deemed to be withdrawn, only the issue 
of such a sample to an expert nominated by the person requesting 
the sample. (Rule 28(4) EPC) 

C. DtllCKATEO •TATl« rO« WHICH IMOiCATlOXt AftI WAOI • <i« IK» l*4i<«iIo«« t/« m*« .a 4»*t^*^%\04 SUU«) 




11 Rec'd PCT/PTC 0 2 APR 19E 



lnl«m«tioA«l AppUcctioft No: PCX/ 



MICROORGANISMS 



A. lOtWTinCATlOW Of Of^O«IT« 



NATIONAL COLLECTION OF INDUSTRIAL AND MARINE BACTERIA (NCIMB) 



A*i««#-^-«.'»»«tf.«»«-o*^i-»««»^«^«^*— 23 St. Machar Drive 

Aberdeen AB2 IRY, Scotland 
United Kingdom 



17 December 1990 



NCIHB 40353 



Saccharomyces cerevisiae SC/37HG4 . ^ 

For the designation of Denmark, the applicant request that the 
sample of the microorganism only be furnished to an expert until the 
application has been accepted or fiJially decided without having been 
accepted (Section 22(7)). 



wo 92/13103 



PCT/L'S92/00376 




SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



2/48 




JO 



CO 



CD 



cx> 
in 



CO 

<£> 

ui 

cn 
to 



CO 

iri 



CO 
GC 
LU 



o 
o 

o 

CD 
CVJ 



CD 
LO 
CO, 

OJ 
UJ 

CM 
CO 



CD 
O 
CO 



CD 



O 
<C 

>- 



Y 



OQ 



CO 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



3/48 



CO 



in 

LU 



CO 



o 
O 



LO 
LU 



O 
Q- 



cvj 
CD 



.^1 



I I 



CO 



=Llu 



O 

O 



o 



CvJ • 

LO 



1^ 



CO 
LU 



LO 



CO 

cc 

LU 

CC 

<: 



CO 



CO CXZ 



CD^ 
LU CD 

— » CD-4 
CD 



— ^ 
CT> LU 

UJ OC 

n: 

OC 



CO 
CO 

CO 

o 

LU 



CD- 



CD- 
CD' 

CD- 
CD" 



CD -J 

CD- 
CD- 



JQ 

o 
O 

CO 



o 
eg 



CD 
CO 

CD 

CO 



CD 
CO 
CO 



UL CD- 

CD- 



CO 

c\j 
LU 



CO 
CO 



o 

CO 
LU 
CO 

CVJ 



o 

CO 

eg 

CD 



to 3c 

CO CO 

O o 

CD O - 

CO CVJ 

eg 

o 



.-J 



LU 



Y 



J 



CO 

t 



S3 



SUBSTITUTE SHEET 



wo 92/ 1 3 1 03 PCT/US92/00376 



4/48 




CffOQTfTiiTr curc-r 



WOf 92/13103 



PCT/US92/00376 



5/48 




cMPQTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



6/48 



<5csjoo^OvocNjeo'*o^cvioO'^ovoc>S^«sj«Ni»nj3>^ 



oe o ui 



to 




o 



O Q I 

-J CD 

z q: 

S 1^ 

3: z 

^ O to 
t/> U- 



01ljJO>->-LL.Zt/>3:C^ 
" OOTftS^^^P 




^ o «f ck: 

on>^cx.^z.JUJZ<Oa-44/>a. 
>"0«CJ^-Z*COZ5i^O-J4>^0 

UJ^ZZ</)Z^^Z><X.^UiZ 



o uj ^ a: 



i/> o 



< UJ oc 

< </> UJ 



S o z ^ u 
o Ui :c o z a. 

CD UJ 



S UJ ^ 3 



H- i/> i/> O Ui 

Z -J <^ o »— 

H-ZOZ««i/>>-ZC>>- 

03a.tta.i/>4/>oa:Si^<oui> 

</^^inZZC^UJi/>i/)H4ZUIV04/> 
>*Ol--«OZ«JWOVO>*>-4/>i/)«J< 



o z 

»— Z Q >- 

o i/1 ^ a 
>► > o 
-J ^ >- 

Lu I— «X ^ 

i/> ot: a 
< z z ^ 



u. o 



< ex. CJ 
^ z ^ oc: 
o a: z o 

1-4 Ui O 4/^ 

o o c z 
oc I— qe: J 
UJ > ^ O 
a: o u> Ui 
Ui z >- Q 
^ ^ o u) 



Z €1. i/» i/> M 
i/> J i/» 

O 4/> O 

Ui Ui z ^ «J UI 

O Ui Ui 4/) lA. 
O Ui VO i/) CO c/> 
O Ui CO O CO o 
U. Z t>0 bi^ ^ 
Z O 4/> 4/> ^ OO 



Oi/)«CZ4>OUiO>- 
*CZZb^^O^COCO 

Cida:oouio>>- 

►HJJCdV^4/>4/>Z 

o«-i<czq:coz»h 
<r i/> CO a J a. 
a < *t; oo Q c^ h- 

Z^VO^OUiOO 



a. 
O 



Ui z 



^ oo 



U. OC CO 

u ex. > 

l-f Z CO 

a. O H- 
iO O 
Z CO o > 

^ Ui VO Ui < CO 
O OO Q > Q» U. 

Q >- c5 o > ac 

h- Zi/>>»UiZj 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



7/48 



to' 



oooooooooo 



OOOOOOOOOOOOOCSJ 

CsJCVIfSJCVJCNJCVJCSJCNJ 



Ovoc\JOO^Ovocsj<o^O\Oc\JCO^O 



O O ^ c3 3C 

^ a. M — 



O 

-J 

CK^ •-^ UJ I 



CC UJ 




So Of ^ 
^ LlJ 
> Q 

a: 



to 




< ^ o o. 
< o o. <c a i/) 

> a S uj o 



H> < Ui o 

> >^ U. > L&J i/> 



^ ^ a. a, _ 



^ ^ Ck. O i/l i/> CO S CD o ^ 



CO 



_.5^aci-i2acco^^ocooocoooOiIj5 
c3cx-Q.co»--i/>OJ2:^->-SSocoCoi^ 



oc: o 



UJ CO CO o 



O CO 

o z a. > 

^ UJ ^ CO 



l-H Ul UJ V) 



M -J w 



co<c>»^o£<fa.coujHHc/> 



y0^iiM3EOcozW^uiO<l--3C-<CcoSooe:P-5-J^^ 
uj<^ujoH*co^uicoo^ouia:2^cococoei>o>co 
Lu5^>-^uiou>vo,jotfcoo«jac-J>co>-,jc^a.coo>H-o 

^ZLfc.*-JO0ir<ZC0OC0Qf^ft— OC0c0UiH-C0l-HLLi»»OC0 

o\^cx.iHQ.ocoo^a.ococococco<>a.uJcOH-c^cox 
-JcuuJUJh-uJ^^QiA.<i.ck.co^i^CiP52cocozC^a- 
Q Uu H- O ^ 



CO > 



a. 



O CO CO 

CO < oc 



^ 2: a. q: 
CO a. CO ^ 



c/>uj«jococL.oicoacouJ^KHOi-40a.za.co 
>«voo>i--i>oooH-^coa.zcoco<c:k:a: 
co^*jQ3CH-o:^oguj^35^k-i— coo 



CJ C3 i/l i/> 



^ > CO a. 

<^ VO 3C i/> 

_^UJi^Zi^h->— COi3cO<UJQ.O^O 

zpcJ:^a.<-jQ:oi-zcoi-iQ<2coh- 

_ ^ ^ ^ , ._ooQi«iOL.^ijJttar5a.zco3:co3>coco 

J^t^J^R^!^^ie-^^iii*^^^*^>--J3CJQ:ii^>-Zcoco 

3u.acoj^u.w>co^zcoHHOH^a.a3:co^<x:>-i3couj 
a.co^«uj>oCi/ia,uji-^^^cococodui<S^Scn 



< Q ^ O 

_ O DC Z 

tf<^aq,S»Hcoo 



Cl .J ^ 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



8/48 




SUBSTITUTE RHPCT 



wo 92/13103 



PCT/US92/00376 



9/48 




RffRj^TITllTP currx 



wo 92/13103 



PCr/US92/00376 



10/48 



rH H x: r- < >i oj H 0) r-^ H 



to- 



O 3 



o 

O 



> 



o 

Eh 
O 



> 



O E-« <U 

O O U J 



U 0) 
Eh £ 
Eh 0. 



O >i O 3 

O <-H Eh <U 

O O O iJ 

O J3 < >> 

< Eh Eh Eh 



U U O U 



O 
O 



(0 



<: w 

o •< 

O 0) 

Eh x: 

Eh CU 



< >i O CJ^ O 3 
O O Eh 0) 



< 3 O 3 
Eh <U 



3- 



00 

CM O 



Id <N o 
00 O 



< 



o 
u 



U 04 

EH >^ 

< >i 
Eh H 

O .-H 
Eh BJ 
O > 

< O 

O i4 

O PU 

EH 10 

O rH 
O < 

O (0 
U rH 
O < 

O .-H 

Eh 10 
O > 

Ka: 10 



s 



o 
o 

Eh 
O 



(0 O 
•—I o\ 

«< rH 

3 
►4 



o <0 

o < 

O 10 

o <: 

Eh U 

EH C/) 



frt 3 
£h <D 



O 
H 

O 
U 
O 

O 
O 

U A* 

U iH 
Eh 10 
O > 



E^ 
M 
2 

«0 



O 



o 

Eh 
U 

u 
u 

Eh 



to 
< 

x: 

Eh 

Eh 
M 

(0 
>i 
U 



33- 



o 

Eh 
Eh 

U 
U 
O 

O 
H 
O 



3 
10 



(0 

> 



O 3 

Eh Q) 
O 

< >i 

o ^ 
o o 



U ro 
O rH 
O < 

< O 

u u 
u cu 

a v< 

< >i 

Eh Eh 

U >i 
e> rH 
O O 

Eh <U 

O rH 
O O 

<: 0) 

>< M 

O 3 
Eh <y 



u 

Eh 

o 



0) •'T 
rH «T 



<0 

> 



Eh >, 
O 

O O 

EH 3 

Eh 0) 

O ►J 

Eh 10 

O rH 



o 

Eh 

o 



0) 



0) 

a, 



u 

£h 



C 

*< eg 
(A 



O 3 

Eh 0) 
U 

O 3 

83 

Eh >-i 
a Q) 
Eh to 

U <0 
O rH 
O < 

< >i 

o o 

Eh >-( 

Eh Eh 

Eh >i 
O rH 



Eh Eh 




14; in 0 0) 
(< < •< to 



00 U 
0\ O 



CM 0 0) 
10 Eh X 



U n Eh 04 



Eh >^ 
O 0) 
rt: to 



O rH 
Eh <0 
O > 

Eh >^ 

O rH 

O O 
Eh Jh 
Eh Eh 

O rH 
Eh 10 
O > 



U 0) 

Eh to 

U Q) 

£h .-H 



< 
H 
O 

tJ 

Eh 
O 

< 



10 

> 
Q4 

)h 
Eh 

U 
>i 
Eh 

x: 

Eh 



O 3 
Eh Q) 
Eh ,J 



Eh 
U 

U 
Eh 
Eh 



3 
0) 

0) 



O rH 

O ID 



Eh 
t5 
Eh 



(0 



o w 

53" 



O 
Eh 
O 

O 
Eh 
< 



3 

Eh 
M 



EH rH 
«< M 

o cr< 

O >H 

o w 
< >1 

Eh Eh 

O 3 
Eh 0) 
O ►J 

O 3 
Eh fl) 

S3 

o o 

Eh <0 
O rH 



O >H O t3 >l 
< >lU> CD rH 

Eh Eh O O 



U Vh 

Eh Eh 

O 0) 
Eh 

Eh CX4 

O O 

U U 

O 04 

O <U 

Eh x: 

Eh fU 

O & 

Eh Eh 

< >H 

U Q) 

Eh to 



3< 

Eh >^ 
O 0) 
Eh to 

Eh O 
O U 

o o< 
o u 

t!) Q) 
<< to 

O O 
U ^^ 
U O4 



tJ 
Eh 
O 



3 
0) 



Eh 4:: 
Eh O4 



O 
O 

tD 

o 
o 

E-i 



(0 



Eh 
M 
S 

(A 

u 



u a 

< tn 
t3 < 

O CO 

33 

O rH 

Eh to 
O > 

CD rH 

Eh «J 
CD > 

Eh U 
O <U 

< to 



< 
CD 

CD 
Eh 
< 



(0 



ID C 

i< rH 

U CD 

^ a >H 
in Eh to 
CD 3 

< rH 

CD CD 

u m 
u X 
o w 

S3 

CD 3 
Eh <U 
U »J 

U O 
Eh £ 
Eh O4 

U <D 
Eh ^ 
Eh O4 

Eh O 
U U 
U O4 

Eh CP 
O »H 
U < 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



il/48 



<r> U rH 
in o < 




33- 



o 
o 



o 

< 

< 

< 
Eh 



o 



Eh 

o 

Eh 
O 
Eh 



O 
Eh 



CO Eh ttfM 
VO #< W CM 
»n O < ^ 



10 




u u 

O Q) 
H CO 



3 

U 
< 



w 
>1 





O O 


O Eh 


O Eh 




n O 


O O 


<r» Eh 


O Eh »H Eh 


o 




rH CJ 


< 




£h 


Eh 




o 


Eh 






Eh 


£h 




O 


O 


Eh 


< 


U 





o 



Eh 

o 

Eh 
Eh 
O 

3 

Eh 



O 



O 
O 
< 
Eh 
Eh 
Eh 
U 
< 
Eh 
< 



O 
Eh 
Eh 
Eh 
O 
8h 
£h 
O 
O 
Eh 



o 

CO 




SUBSTITUTE SHEFT 



wo 92/13103 



PCr/US92/003f6 



12/48 




EHOrt:oHO<0*<OE-«OUOE-«OCJO*j; 


o in o 


CM 0> O VO O 






O Eh r-- O 


2 .H < 


(rf in F-4 tr> 




vo Eh 






COEhOO<0^<4:00 


Eh r-< O 


T-l »< rH 




r^ o 






r^ u 


rH Eh 


rH O f>J < 


u < 


o 


o 


Eh 




o 


Eh 


O 






< 


Eh 


U 








Eh 




o < 






O 




o 


Eh 


S 




< o 




o 








O 








Eh 


< 


o 






Eh 




O O 




o 




o 




Eh 


< 


u 


rt: Eh 




CM 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



13/48 




SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



14/46 



;5 





a 


in < 




V.-/ 




Eh 




< 






r . 


< 






Q) 








• V 




rH 




< 






u 












U 




O 










W 






O 












< 


o 


u 






Pm 






CM #< 


• 










o 




U 


O 


<! 




3 




o 


o 


o 


1— 1 


Eh 




U 




O 




< 




V-/ 


V-/ 


Eh 


c 










< 






rH 




O 


t> 




Eh 


Q) 


O 


»J 




>i 




f-\ 


u 





CO 

o 

iH 



fH 

00 



Eh 
Eh 



EH 



% 



% 

a 

Eh 
O 



% 



c 


rM Eh C 


rH 


V£> <e£ (fl 


Ih 
>-/ 


iH ltd ftC 






^^ 


a <i) 


**% 


CH LA 




< <o 


0) 


u ^ 


iH 




c 


O 3 






< 


o o 








u x: 


cn 










#<5 rH 


-< 


o o 






iH 


H 0) 


O 






cn 










(0 




>1 


u ^ 








in Eh ::5 


0) 


m H <D 


>-] 




<d 


E-( C/) 


rH 




< 


U 53 






Gl 




tH 


o u 




U 0) 


> 




c 


Eh C 


iH 




O 




to 




>1 


cn 


•-3 




eu 






U O 




< :5 


0) 


H 0) 






c 


o o 


«H 


kC .h 


o 


o o 



VO Eh <a 
tH u r^ 
<N o < 

O H 
Eh W 

< s 

Eh <d 

o < 

H 

o o 

Eh a 

< V) 

o < 

H 

o o 

Eh 0) 
Eh rH 

< M 

Eh 

O 0) 

< W 

< >i 

O rH 
O O 

<«i C 

CO < <H 
rH U O 

< :3 

Eh 0) 
O h5 



O U V< 

eg < cn 


^ < >. 
(N O rH 

O O 


Eh a 
< w 
O *< 


Eh >h 
Eh Eh 


< :s 

Eh 0) 
Eh iJ 


Eh 

O Q) 
Ph CO 


u c 


O ^1 


H Q) 

r \ . 1 


o ^ 

»-l 


O 3 
< 

r ^ r f> 


O U 
U Q) 

CH UJ 


< U) 

< >i 
1— I 


O E-t 
Eh W 


H D 
H 0) 
O 1-^ 


33^ 


O U 
O << 


< M 
O 0) 
Eh CO 


< «H 
CM C!) CD 


O cr> 

o\ O »H 

CJ O 


< 3 

Eh 0) 

fcH t— 1 


O 0 
Eh 0) 


Eh Q) 
cH M 


Ks; ta 
>i 

^ . "1 


E-« a 
< w 

**** 


< 

Eh rtJ 
CD !> 


Eh Q) 


< >i 

O rH 
O O 


o c 

< iH 
f 1 r 


Eh O 
O U 

f 1 fL. 


< >i 

O rH 
O O 


O 0) 
Eh 


Eh 

U 0) 

tH W 


Eh C 


Eh 

O Q) 
Eh W 


Eh 

< W 



SUBSTITUTE SHEET 



, wo 92/13103 



15/48 



PCT/US92/00376 



to' 



^ < >^ 

oj O 
n O O 



O Cr> 
0\ ID U 
<v> o < 



Eh 

o 



0) 

c 

0) 



CO O E-« 
H W 
o rt: S 





Eh O 


< >, 




r 1, f » 






Eh 


a <u 


Eh <0 


Eh CO 


0 > 




Eh 0 


O >H 










CH >-| 


E-I Q) 


0 0) 


U 


<: to 


U )>4 


0 u 


O d) 


0 >i 


H W 


Eh U 


O Eh 




Eh W 


< «H 


< S 




<< (0 


< >, 


1^ 








< ^1 




U 0) 


0 0) 


Eh W 


Eh CO 



«H Eh 

in o ^1 
n O < 

<< CO 

< u 

O 0) 
Eh CO 

•< r-l 
O > 

a <u 

Eh CO 

< 

O O 

^ rH 
O CJ 



o 
u 

a 



u 
< 

u 

0) 

to 



<N Eh >-i 
m < >^ 
Eh Eh 


00 < rH 
^ 0 CD 


0 < <U 
£h rH 
in ^ t-1 


< >. 
Cj) CD 


< Ui 

< >1 
<< »J 


< o> 

< < 


Eh M 
< Eh 


u a 
0 < 


< 


Eh U 
CJ 0) 
i< CO 


Eh 3 
Eh 0) 


Eh ^ 

0 x: 
< en 


0 0 


Eh 

< CO 
0 •< 


0 P 
H 0) 


0 ^ 


Cj irt 
CH lU 

0 ^ 
0 < 


cH C 


u >^ 

0 0) 

#< CO 


Eh P 
Eh <U 


0 c 
u 0 


^ >^ 
0 0 


|h s 


£h P 
Eh 0) 


Eh C 


Eh 0) 
Eh J 


35 

0 0 



in •< fH 

O H (0 
""T o > 

Eh Q) 

£h z: 

£h 



rH 



O C5 



o 



CP 

u 
< 



< o 

U O4 



Eh 
Eh 
Eh 

< 
U 
Eh 



0) 

U 
CO 



Ci) 



in CJ <U 
^ CO 

ID U 
< < 



<n Eh io 
iH O «-i 
tn C5 < 



in 



in 



SUBSTITUTE SHEET 



wo 92/13103 



PCr/US92/00376 



16/48 



O CP 

u u 
< < 



< 



< < 



o 
< 
o 

EH 
EH 



u 
<1) 



< u 
o x; 

< 6h 



3 



a o 



< 

E-« 

EH 

o 

E-« 

EH 

Eh 



3 
<V 
•J 

M 

CO 

(U 

04 



Eh C 

33 



3 



U 

O X! 

< 3 
Eh 0) 
H *^ 

H O 

U P4 



Eh 



0) 



Eh 

m: CO 



a 

(A 



CO Eh >i 
''i' O <-H 
vo O 

Eh Q) 
U 

C 
H 

u o 

■H 

p 

-H 

o o 

O Eh 
Eh W 

< S 

O nJ 

c5 rt: 

Eh .-h 
Eh <0 
O > 

< C7» 
O »H 

< 

U Q) 

fNj Eh rH 
VO •< M 

c 

-J 
U CD 

O CP 
C3 ^1 

< < 

o >-* 

33 

O CD 
EH 

33 

C5 C3 
O ZJ 
Eh iJ 

u o 



O < rH 

U O 



o c 

u o 

Eh 0) 

Eh rH 

i< M 

< CT* 
15 U 

< < 

O Id 

o << 

< 0) 

< M 

< tr 

< < 
< 

CJ >-i 
U 

o c 

< ^ 

in <: rc 

r- O rH 

vo cj) •< 

C5 U 




3 



Eh 
M 

a 
w 
< 



(J) 

Eh 

U 
U 



o 
x: 

Eh 



in < «H 
C5 O 



< 

U 
C5 



V4 
Eh 
id 



u o 
u 

Eh to 

C3 C 
< rH 
O O 

1:3 



Eh 
Eh 
U 



3 
0) 



o c 

< rH 

O C5 
<y\ *< 
r- O < 

< 0) 
Eh rH 
•< H 

£h 

C5 U 
O < 

U 

< Q) 

rH 

< M 

a (X 

< V) 
C5 < 

O W 

•J 

33 

O O 

O Q) 
H rH 

< H 



O O CJ> 
rH O M 
CO U < 

O 3 

o o 

Eh Id 
O ^ 
O < 

Eh a 

< W 
O < 

Eh W 

i< -rH 

O X 

O 0) 
Eh to 

O >i 

O rH 

C5 O 
O U 

o x: 

< Eh 

CD t5 

o Eh W 
00 -H 

r- U K 
o w 

33- 
33 

o c 

< rH 
O C5 

Eh U 
O 0) 
Eh CO 

< U 
O 0) 
Eh CO 



CD 
CD 



CP 

u 
< 



< rH 

CD O 



(d 



Eh >i 
vo CD "H 
CO CD O 

Eh C 

«, «. 

Eh >, 

O O 

H iH 

Eh CO 

Eh U 
CJ JC 

< Eh 

•< <d 

O rH 

CD < 

O Eh 
Eh U 

< S 



U C 



5" 



u <u 

r- < 3 

n < rH 

CO CD CD 

< >i 
CD <-t 
CD CD 

CD rH 
Eh flJ 
CD > 

< >i 

CD rH 
CD CD 

33 

O O 

Eh >i 
CD rH 
CD CD 



3 

CD 

3" 



CD CD 



CD C 

< rH 

O CD 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



17/46 



SO" 



CO E-* U 
a\ < Ui 

O fl) 
< CO 



3 



£h (0 
O > 

Eh ^< 
O 0) 
< CO 



O 
O 
O 

s 



(0 



u 

6-1 



35 

cr» < -H 
eo O tC 



O 
Eh 

a 
< 
< 

Eh 
O 

O 
H 



Eh 
U 
S 

< 

u 

XX 
Eh 

<U 
CO 



Eh >, 
O C 



rH 

o> O O 

O 'H 
Eh <0 
O > 

o to 

O U 

o x: 

< Eh 

< >, 

O U 

£h Q) 
CJ iJJ 

6h to 

O X 

Eh 

O 0) 

< CO 

< U 

a ^ 
in t5 z> 

Eh <U 
<J> O ,-3 

O CP 

< < 

CD >H 
U < 

Eh O 
O U 

O r-K 
O < 

Eh V< 
O 0) 
Eh CO 



< 

o 
o 



•H 
}H 

(O 



vo t5 

CM O 

o ^ 
H 

o 



<U 
CO 



O O C 
CO < iH 
O U CD 



n CJ OJ 
*H Eh to 



< CO 
C5 < 

E-« a 
•4; CO 
C5 < 

O (A 

< (0 

o < 

Eh to 

< -H 
CJ K 

Eh >-i 

< Eh 

Eh >, 

O C5 

cr» Eh 3 
0\ Eh Q) 
<^ O ►J 



O 
Eh 
< 

g 



Eh 
O 

to 



o ^ 

Eh <U 
Eh 



E^ 

Eh 

Eh 

C5 
Eh 
C!) 

C5 

< 



U 
0) 

to 
»^ 

Eh 



(0 
> 

Eh 
M 



< 

C3 
U 

C3 
Eh 

O 
CJ 



0> 

Eh 
W 

0) 
CO 

o 



^ «H 

< M 

£h to 

Eh U 

O >^ 
C5 0) 

< CO 

CJ a 

< to 

CD < 

If) a 0) 

O Eh CO 
fH 

o u 

o <u 

< CO 

Eh U 
CJ <U 
Eh CO 

O Eh 
Eh M 

< s 



Eh 
U 
O 

•< 

Eh 
O 

O 

£h 

Eh 
O 
< 

< 
C3 
O 



(0 



<D 
3 

a> 
u 

XX 
Eh 

D» 

< 



Eh W 

< -H 
O K 

< P 
Eh <U 

Eh 3 
Eh 0) 
O h3 

r> C5 C 
O < iH 
H O O 

rH 

O <U 

p 3 
Eh 0) 
O .J 

CJ 3 
Eh 0) 
O 1^ 

Eh O 

CJ 0^ 

Eh 3 
Eh OJ 
U 1^ 

Eh W 
O >, 
Eh O 

< >^ 
CJ) ^ 
C3 O 

Eh >-i 
U Q) 
Eh CO 



00 i< 
CO CJ 
•H O 
iH 

Eh 
C5 
< 

O 
U 
C5 

CD 
C!> 
< 

CJ 
CJ 
C5 

O 
C3 
O 

Eh 
O 
CD 

C5 
< 
C5 



•H Eh 
\0 C3 
fH 
tH 

O 
O 

CD 
CD 
U 

O 
CJ 
£h 



< 

CD 
CD 

O 
Eh 
Eh 

CD 
Eh 
£h 

< 
£h 
CD 



SUBSTITUTE SHEET 



PCT/US92/00376 



18/48 



< f-i 
<M O O 



O CP 

< < 

e> o 

-< tp 
o u 

< < 

< {/) 
O < 

H & 

< (A 
O < 

E-» O 

U CU 

IT) O C 
rH leC fH 

o) U O 

tH 

U 0) 
E-< W 

U (0 

a w 
^ 

O O 
H 

< M 
O C 

:^ ^ 
«. «, 

U U) 
-H 
O E 

o :3 

O ►J 



10 



\0 O 
H 

Eh 
O 



a 

Eh 

(A 
>i 
U 



< Eh 

o o 

E-i (/) 



Eh 
(0 



EH 

O < 
O CP 
U < 

< <U 
Eh ^ 

< M 

cn O C 
vo < <H 
o» U O 



33 



o 

Eh 
Eh 

Eh 
Eh 
O 



3 
0) 

0) 



Eh in 
< -H 



EH 

Eh 
U 

U 

O 

< 

o 



3 
0) 

> 

tp 

< 



U 0) 



o < o 
tn o 
n O 



O 
< 
< 



Eh 
M 

O 



< tn 
e> < 

o Eh 
Eh M 

< s 

<n O >i 

(NJ O 

n O O 

< o 
o o 

Eh M 

< -H 
O X 

Eh 10 

O rH 
O < 

o o 
o c 

< r-H 

u o 
o a 

Eh Eh 

O 3 

< t-i 



O U 0) 

<<f H W 

Eh 3 

Eh <U 



<0 



r- H O 
r- U V< 
o U Ou 

Eh W 

o >. 

O <U 

EH rH 

o c 

Eh W 
«< 

O X 



3 



EH 

O 

Eh 
U 



(0 
> 

O 



U 

o 



10 



CO Eh Q) 
lO Eh rH 
•«* rtj M 
H 

o la 
o < 
o c 
u o 

Eh <y 
u ^ 

< >i 

O rH 
O O 

O >i 

O O 

D 

Eh 0) 

33 

O O 

3^ 



o 



Eh 
M 

<0 



Eh W 

O W 

< CP 

e> )H 

Eh V) 

O K 



3 



o 



(0 

< 

Q) 

a* 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



19/48 



CM U M 
.H < >t 

in H Eh 
rH 

u w 

< -H 

U K 

o a 

< tn 
o 

H C 

3:s 

< Eh 
Eh 3 

S5 

O >% 
O .H 

o o 

Eh 

< >i 

e« t^ 

O Eh 
Eh M 

< S 



in 

CO 

^ O O 



Eh 
O 
Eh 

U 
< 

o 

Eh 
O 



Ui 
>i 
O 

a 
< 



la 
> 



35 



EH 

Eh 

< 
Eh 
Eh 



3 



35 



vo < >1 
U> O •-« 
in o O 

E-< 0) 
H J3 
Eh CU 



Fh 
O 

Eh 



o c 

33 

< Eh 

O P 

Eh <U 
Eh 

Eh <0 

O < 

O Eh 
Eh W 

< S 

0\ < >i 
C5 

in O O 

Eh <0 
O 

O < 

Eh U 

< >t 
Eh Eh 

< Cn 
U < 



u 



< < 



< 

Eh 



< }H 

< Eh 
Eh <U 

< M 
Eh 

< CO 



O Eh :3 

rg Eh Q) 

VO O •J 
H 

< (0 

O < 



< 

o 

Eh 
< 



CP 
Eh 

s 



O >i 
Eh O 




Eh U 
U Q) 
Eh CO 



in Eh 

Eh 
O 

O 
O 
< 

Eh 
O 
t5 

C5 

3 

U 

3 

CJ 
O 

o 

Eh 

< 
ID 



(0 
>i 
U 

1-5 

U 
JZ 
Eh 



to 

C 
10 

< 

(0 
•H 



(0 

> 

a 
to 



-t*- Eh rH 
Eh «0 
vo O > 

Eh 

O 0) 

< CO 

< rtJ 
O rH 
O < 

Eh <U 
Eh rH 

< M 

Eh rH 
Eh rtJ 
C3 > 

O C 

O O 

o c 
u o 

< 3 
Eh 0) 
Eh h3 

CJ a 

< (0 

o < 

^ < 

\£) O O 
rH 

Eh ^ 

O <l) 

< to 

35 

t5 13 

Eh ^4 
O Q) 
Eh to 



CO < 3 
rg rtJ rH 

r-» O O 

rH 

< CT 

O < 

O P 
Eh 0) 
Eh J 

O >H 

u x: 

< tH 

33- 



< ►J 




Eh >h 

< to 




Eh C 




< rH 

Eh rtJ 
C3 > 




1701 
GCA GAT 
Ala Asp 


If) 

in 
r- 

rH 


< cr» 
o < 




CD (X 
C5 >H 
Eh Eh 




e< >H 
O <U 
Eh to 




C5 3 
Eh Q) 
Eh ►J 




Eh C 




C5 cr 
< < 




(J p 

Eh <U 
Eh ^ 





CO o a; 

r- Eh CO 

o o 

35- 

< »^ 

r-i 

Eh <0 
O > 

35 

C5 o 

< 3 

Eh <U 

Eh (TJ 
O < 

Eh W 
C5 >i 
Eh O 

fH 
O C!> 



ca 

Eh 

< 



Eh 



Eh Q) 
Eh ,J 

< la 

U rH 
C3 < 

< (0 

< >i 

< h3 

O rH 

Eh (0 
O > 

Eh 

O Q) 

< CO 

< >t 

O rH 

o o 

Eh rH 
Eh rtJ 
O > 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



20/48 



to* 



vo O 3 
r> «< •-• 
00 O O 

H U 

u x: 

< £h 
U W 

e-* (A 

< nj 
o < 

< >^ 

O 0) 
E-« W 

O =J 
Eh 0) 

H h3 
H C 

o a 
o ^^ 

< 3 
OHO) 
00 H ^ 

O < 

o o 

< CO 



Q) 



o > 
U J-i 




O u 
<n o x: 

CO i< Eh 

o 

o o 

O > 

O D 

H .J 

H <D 
Eh 

< (0 

a rH 

o •< 

EH 3 
Eh <U 
O ►J 

< <d 
O *< 

Eh >, 
O .-I 
O O 

n Eh 
VO f< CA 

CO O < 

< '-t 

Eh (d 
O > 

Eh <d 
U rH 



Eh 
O 
Eh 



to 
O 



EH a 




Eh C 



^ o >. 

o <-« 
a\ o O 

Eh 
O 

O O 

.< >i 
O ^ 

o o 

Eh U 

Eh <U 

< M 

O «J 
O rH 
O < 

rt: 3 

Eh 0) 
EH 

Eh M 
rH U £ 
a\ •< Eh 

U C 

< >-• 

o xi 

< Eh 
O C 

ri: .-I 
u o 

u u 

O Q) 
O D> 



Eh 



u 

Eh 



EH >H 
< Eh 

H 3 
Eh 0) 



CO U 0) 
<T» Eh rH 
<n < H 

rH 

»^ rH 

u o 

O CP 

< < 

u wi 

-H 
U X 

o a 
Id: (A 
u < 

O 3 

o o 

Eh C 

< Eh 

Eh (0 
O .H 
U < 



rH 



o 

Eh 



0) 

rH 
M 

3 
•J 



U 

O Q) 

< w 
a ^ 

O 0) 
Eh to 

O rH 
Eh «J 
O > 

Eh C 

:^ ^ 

u < 

< 3 
Eh 0) 
Eh >A 

< Q) 
Eh rH 

< M 



r>j Eh 
in o 
o < 
ra 

Eh 
< 



Q) 
W 

(A 
•H 



Eh U 
O 0) 
Eh to 




< 3 
Eh <U 
Eh 

U (A 

< -H 



3 



U O 

< 3 
Eh Q) 
Eh »J 

3 

Eh 0) 
Eh J 

in Eh Vh 

fN o x: 

O < Eh 
rg 

35 

U O 

< 3 
Eh Q) 
O iJ 

Eh W 

Eh O 



3 



C 
(A 



O C 

3^ 

O 3 

< rH 

o o 



CP 



< < 



< 

Eh 



3 
0) 



o 

rH 

eg 



o 



I 

< 



c 

tA 
CP 

>H 

10 



U rH 



o 

Eh 

O 
Eh 



to 

3 
0) 



Eh C 
O D4 

O »H 
Eh Eh 

O 3 
Eh <U 

Eh 

O X! 
H 

< >. 
O rH 
O O 



o 

Eh 



(A 
U 



•< (0 

U rH 

3S 

£h ^ 
O <U 
<< CO 



O 

tH 

CJ 

•< 
Eh 
< 



10 

> 

0) 



O X! 

O 3 
Eh Q) 
£h .-q 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



21/48 



o o 

VD 



CM 



35 



n O O4 

r> < w 

O < 

El E-« 

< SJ 
E-< 0) 

< <a 

o c 

o o 




o 
o 



CO E-« 



0 « 


< 3 








Eh ,J 


0 3 


Eh «J 




f * 




0 < 




< (0 


C-i rvi 




< s 


0 < 








0 .H 


< CO 


0 < 




E-i 


in I0 




0 > 


< CO 


< (0 


< >i 






0 < 




W >i 






Eh W 






0 


E-« flJ 




U ^ 




0 < 




CJ 0) 

Eh CO 

Eh cn 

i< -H 

O X 

Eh Q) 
Eh 

< M 

O P 

H 0) 



O 
U 

CM U P< 



CM 
CN 



Eh 
O 
Eh 

O 
Eh 
< 

Eh 
Eh 
< 

Eh 

3 



}H 
0) 

to 

Eh 
M 
S 



C 



O (0 

o < 

(< (/) 
o < 

o w 

U 

< >i 
Eh Eh 

(3 (/) 

O >-i 
CD < 

Eh O 
O ^1 

U (14 

(!) ^ 



Eh 

O 
O 

o 

Eh 

Eh 
O 



C 
(/I 

r-i 
< 

M 
S 

P 
Q) 
►J 



Eh C 



<N rH 

n O O 
rg 

< (0 
O ^ 



O 



C5 CP 
< < 



in 



Eh 
Eh 



01 C!) 

fM 

Eh 
U 



> 
(A 



3 



Eh 
Eh 

U ^ 

Eh >h 
U 0) 
Eh CO 



O 



a 



H 1^ 
u ^ 

O 0) 

< to 

< *H 

U Q) 
Eh to 

U >i 

o t:) 



u 

m U (U 
fM 

Eh 



U 
CO 



Eh C 

o 

Eh .-I 



Eh C 

cn 



3 



<< w 

CJ < 
Eh 

cn Eh >H 
•tr U -C 
n < Eh 

< u 

CJ 0) 
Eh CO 

< 3 
Eh 0) 
tH 

O W 

< -H 
U K 

O C 

< r-i 

O CD 

Eh (0 
O ^ 
CD < 



Eh 
< 
CD 

Eh 



a 

0) 



SUBSTITUTE SHEET 



wo 92/13103 



PCr/US92/003'>6 



22/48 



o H ^1 


CO 




^ H H 












id! lA 




o < 




tr* r>1 




• 1 

#-n 




ID O 




^ it 












U 3 




H Q) 












CD 0) 
















u o 
















r > rA 








u ac 






O 






< < 








ID C 








o o 
























< w 




r ■» 

b~* CP 




r t I 




r 1 -d! 












U ffi 




H >-i 




O G) 








< (0 








O < 




O I/) 








35 



U >i 

CO O rH 

•"TOO 
Eh V4 

H C 

3(0 

E-« 0) 
Eh 04 

■4: (0 

o < 

< 

U Q) 
Eh CO 

O 

O »-i 
< < 



in 



o 

< 
O 



a 
to 

to 
< 



Eh (0 
O K 

O ^< 



EH 

3 



(0 



u x: 

< Eh 



U 

o 

H 
Eh 
Eh 

EH 

Eh 
O 



to 

< 



> 



00 Eh 
in Eh 

U 
U 
Eh 



V4 
CO 

U 
(0 
CO 



CJ u 
O 0) 

< CO 

u o 

CJ 04 

< o 

Eh 0) 
H i-J 

O ^ 
Eh 10 
O > 

< ^4 

U 

< Eh 

H U 
O jC 

< Eh 

Eh C 

:S ^ 



3 
0) 



r-t O 

.H Eh ^ 
in Eh »J 
eg 

Eh 
< 



Eh Eh 

< O 
O U 
U Pu 

< U 
U <U 
Eh CO 



EH 

Eh 



<0 



u 
o 

Eh 
O 

O 
Eh 

u 

3 



> 
x: 

Eh 

Eh 
M 

S 

c 
to 

<: 



CM O 3 
<n < rH 

moo 

O 3 

Eh 0) 
Eh 

Eh U 

O <1) 

< CO 

< o» 

O >H 

< < 




Eh V4 

O Q) 

Eh CO 

Eh D> 

O U 

U < 



in Eh 
\D U 
in Eh 
rg 

O 



»H 
0) 

CO 

}H 

0) 

to 



•< to 

o < 

< S3 
Eh 0) 
Eh ^ 

O 

O Q) 

< to 

< >, 

O rH 

O O 



o 



)H 



< u 

U 0) 
Eh to 

< u 

CJ Q) 
Eh CO 



O 



vo < >i 

O rH 

vo O O 
rg 

CJ 
U 

Eh C 
3^ 

33 

O O 

< u 
u x: 

< Eh 

< <a 

CJ rH 

o < 

< o 

CJ ^1 
U Pu 

Eh (0 

< -H 
U K 

O U 

Eh Eh 

ON O C 

rH to 

< 

<N 

o ^ 
o o 

< 3 
Eh O 
U >A 

Eh >, 
O rH 
O O 

Eh <1) 
Eh rH 
<< M 

O rH 

o o 

CJ CP 

O >H 

U < 

35 

o o 

o u 

< < 



o 
o 



u 

O 

o 



to 
rtJ 



0) 



Eh 
Eh 

< M 

O C 

< rH 

u o 

O rH 
O < 

< m 

CJ rH 
O < 

Eh >h 

CJ x: 

< Eh 
O ^ 

o x: 

< Eh 

U U 
U <U 
Eh to 

n O 0) 
t^ Eh rH 
vo i< M 

O C 

< ^ 

o o 

O 3 
Eh Q) 
Eh ^ 

Eh >, 

O rH 

o o 

O U 

CJ rt: 



o 

< 
O 

CJ 

Eh 
O 



to 

CD 

to 

U 

to 

>H 

x: 

Eh 



e-.mcTITIITE SHEET 



wo 92/13103 



PCT/US92/00376 



23/48 



in o 

r» o o 
rsj 

Eh 
U 
H 



CO CJ 



1^ 



Eh 
O 
< 

< 

< 

< 

o o 
o c 
u o 

Eh >4 
Eh W 
O U 

u x; 

< Eh 

Eh M 

< -H 
O K 

Eh Q) 
Eh .-I 
•< l-l 

U 10 

Eh W 

O >-i 
Eh <0 
O > 

35 

o o 

O Eh 
Eh W 
< S 

U r-l 

O > 



o 

CD 

rg 



CO 
CP 



< < 

Eh 3 
Eh 0) 

< la 

CJ rH 

o < 

Eh C 

< < 

Eh a 

< ta 
e> #< 

00 u x: 

r- << E^ 

Eh nJ 
O > 

£h w 

Eh a 

Eh U) 

< -H 
U ffi 

Eh 0) 

Eh 

32 

(J) O 



Eh 

o 
o 
< 

Eh 
CJ 
Eh 



x: 

E^ 

Eh 

0) 
CO 



00 



U O 
£h CO 

Eh C 

3^ 
35 

O >-i 
O 0) 
E^ CO 



C!) 

3 



(A 



Eh ^^ 

o x: 

< Eh 

U Q) 
E^ JC 
Eh (X 

Eh C 

3^ 

< >i 

Eh H 



in Eh 
m o 

CO < 
CM 

U 



3 



Eh 

c 

CO 

< 



< u 

Eh CO 
Eh W 
O 33 

< U 

u x: 



Eh 

O 

O 
O 
O 

Eh 
O 
t!) 



to 



C_> Q) 
Eh CO 



V£> £h 



cy> 

CM 



c 
in 



O Eh 



O 0) 
Eh CO 

Eh U 
U 0) 
Eh CO 

< CP 
C5 »H 

< < 

13 to 

35- 

O U 

< >, 
Eh Eh 



3 




Eh 
< 

Eh 
U 
Eh 

Eh 
C5 
Eh 

< 
U 
< 

o 
o 
< 



3 



Eh 
M 
S 

<U 
CO 

CO 
>i 
CJ 

U 

x: 

Eh 
CP 
< 

c 

< 



o 

r- Eh 
o> < S 

35 

a o 

Eh >, 
ID CD 



Eh 
Eh 

U 0) 
rt: CO 



Eh 



to 



C5 < 



SUBSTITUTE SHErr 



WO92/13I03 



PCT/US92/00376 



24/48 



'«^' E-« 
rg < 
o 

9-1 
O 
< 



U 
>i 

0) 



O >i 
H 0) 



< 



a 
en 
< 

a 



a» *eXj fH 

<n O O 

fM 

O 0) 

< >t 
E-« 

U U 
U 0) 



Eh 

< 

O 
U 



0) 

M 



O O 




CO 

o 
n 



Eh 
< 
O 

Eh 
< 



a, 
in 

Eh 
W 



E-i W 



(0 



< to 
o < 



o 
u 



< >. 

Eh Eh 

u o 

Eh >, 
O «-! 
O O 




O <U 
•< CO 

Eh U 

< >t 

EH EH 

Eh C 

%% 

< 0) 
Eh rH 



If) 
O 



o 
u 



rH U 0^ 



V4 

Eh 



Eh a 

< (0 

o < 

< p 

83 



3 



o o 

EH 

< (A 

o < 

Eh C 

<4: (A 



00 O 

o 



U 
< 



Eh E^ 



< 



V4 



3 

< u 

Eh W 



c 

0) 

c 



Eh 
U 



O 
V4 



0> Eh >H 
tn U 0) 
rH < W 

35 

U O 

CD U 

< < 

< >i 

O rH 
O O 

EH U 
U <1) 
Eh CO 

O C 

O D 
Eh 0) 
Eh ►J 

C5 C 

#< rH 



< 



rH 



O CJ 

Eh a 
C5 < 

33 

C5 C5 

■< 0) 
Eh rH 

< M 

< o 

Eh rH 
i< M 

U W 




C3 O 



U O 



C7^ Eh 
(M U 



vo 

fM 



o 
o 



:3 

0) 

ij 

(A 
•H 

to 
>1 
►J 



Eh a 

< tn 

ID < 



Eh 
< 
O 

Eh 
O 
< 

O 
C3 



a 

(A 

< 
u 

Eh 

U( 
0) 
CO 



O 3 
rH 

Eh V4 
O A 
< Eh 



Eh 
< 
Eh 

H 
CD 



U 



(0 
> 



H O 
O ^H 



s 

Eh 

Eh 
O 
< 

< 

< 

Eh 
O 
< 



6-1 
U 
£h 
M 

u 

0) 
CO 



•4; m. 



cnncTiTIITP fiHFFT 



wo 92/13103 



PCT/US92/00376 



25/48 



1^ 



CO U 

r) U 
n 

< 
O 



1-1 



CP 

U 

< 

U 

tn 



a 
< 



u 
>1 



H Eh 
< O 
U (U 



EH 

U 
Eh 

Eh 
Eh 
U 

Eh 
O 
Eh 



U 

w 
l-( 

> 

>1 
(J 



35 



(5 C 

o O O 
n 

O C 

u o 



Eh 

Eh 
< 
U 



0) 

x: 
tn 

•H 

5C 



rt: o 

o 

u 

35 

O O 

CJ 0) 
Eh 45 
Eh a. 

O W 

33- 



fM ^ C 
O < i-H 
^ U O 
CO 

Eh C 

33 



0) 



H 
Eh 

< M 

< >i 
O O 

Eh tn 

< -H 
O K 

Eh C 

Eh U 
U 0) 
Eh W 

Eh >, 

O -H 
Eh <0 
O > 



in < 

n O 
n 

Eh 



3 

< 

35 



< 

c 
tn 
< 

u 

XX 

tH 



o 

Eh 



U 
CO 



Eh >, 

O O 

H C 

«, "I. 

CJ <c 

CJ rH 

Ci) < 

< >. 

o o 



3456 
CCT ACC 
Pro Thr 


3510 
AGA CCA 
Arg Pro 


< ^ 


O 3 
CD C!) 


Eh a 

C5 < 


35 

O CD 


Eh a 

< tn 
o < 


35 

O CD 


35 

O O 


o o 


Eh J-i 
< >> 
H Eh 


H tn 


CJ a 
o < 


CD C 

Kt! rH 

O CD 


Eh 

•< (0 
C5 < 




3429 
CAA GAA 
Gin Glu 


n :3 

CO <; rH 

^ o o 

35 

a CD 


Eh W 
C5 >i 
H CJ 


Eh U 
O 0) 
Eh Ui 


Eh <U 

Eh 


U U 


Eh >^ 
CJ 0) 
Eh CO 


EH IT> 
CD U 
O < 


O C 

< rH 
CJ CJ 


35 

CD CD 


O >-i 
C!) G) 
< W 


Eh U 
CD Q) 

< in 


< ^ 
Eh (0 

o > 


Eh >-i 
H Eh 


Eh C 


Eh C 



3564 
CCT ATT 
Pro lie 


3618 


. o c 

< rH 

CJ C5 




Eh a 

^ WJ 

CD < 




O ^ 
Eh 10 
CD > 




Eh tn 
CJ K 




^ CP 
CD U 
O < 




(0 

< 




CD 3 
< -^ 
CD CD 






O ^1 
CD Q) 
< CO 



Eh 
< 



Eh Eh 
H C 

3S 

CJ £ 
< Eh 



OS 

in 
n 



SUBSTITUTE SHFrr 



wo 92/13103 



PCr/US92/0037^ -5' 



26/48 



< u 

C_) Q) 
E-i W 



E-i 
O 



O 



O 04 

<D 



< CO 



U 
E-t 



< U) 



u 

Eh 
U 



U 



E-I H 
Eh (A 




#< to 

CJ U 
O Q) 



3 



in 

vo O O 
m 

O O 

EH >^ 

U 0) 

E-I W 



o 

EH 



Eh U 
O Q) 

< Ui 

O W 

CJ Q) 
Eh W 

U Q) 

EH x: 

Eh (l4 

< U 
U Q) 
Eh CO 



vo U 13 
rj Eh O 



o c 
u o 

Eh C 

tn 

<< < 
o c 
u o 

< < 



O 
U 
o < 

Eh C 

O 0) 
Eh CO 



0> < 
VD Eh 



(O 



o 

Eh 

O 
O 
< 



u 

Eh 

U 
O 
CO 

Eh 

c 



O 3 

CD O 

H >H 
O 0) 

< CO 

O U 
O CD 

< CO 



O U 

00 O 
r- EH 
n 

Eh 
U 



(A 
U 
U 

x: 

Eh 



O «0 
U rH 

O < 

Eh (0 

O < 

O W 

O O 

Eh O 
O U 
O CU 

O C 

u o 

Eh >, 

O rH 

O O 



Eh 



CO 



EH U 

ks: CO 

C!) C 

#< rH 

a o 



< 
o 
o 



Eh 

U (U 
Eh CO 

CJ 0) 
< CO 



o 
u 



u ex. 

Eh 01 

< -H 

U E 



^ Eh 
n U 
CO << 

n 



u 

A 
Eh 

a 
< 



35 

o o 

Eh <d 
O > 



Eh 
O 
Eh 

S 

£h 

Eh 
O 
< 



(0 

o 

Eh 
Eh 



o c. 

O CJ 

•< 0) 
Eh .H 
M 

o u x: 

CO < Eh 

35 
35 

U O 

u c 

Eh Q) 

Kt; M 

Eh U 
O <U 
Eh CO 

E^ >H 
- O (1) 
Eh CO 




00 Eh Gi 

00 < (A 
CO O 

n 

O O 

Eh <0 

o 

< 

O <U 

Eh to 

< U 

U <U 

Eh CO 



C5 



:3 



Eh U 

O Q) 

Eh CO 

O <1> 

Eh CO 

< 3 

Eh O 

Eh a 



VO O 
CO Eh 

O 
< 

Eh 
O 



0) 
CO 

0) 

to 
tn 

_ >i 



< u 
o a» 

£h CO 

Eh 0) 
Eh ^ 
Eh 

Eh tn 
O >i 
Eh a 

< Q) 
Eh rH 

< O 

a >-i 



f;im<;TmrTF curr-r 



wo 92/13103 



PCr/US»2/00376 



27/48 



3 



CM 

0\ o o 
n 

u x: 

< Eh 
H C 

O -J 
O < 

& 0) 

< to 
o < 

< (d 

o o 
in o c 

rH < .-I 

o\ O O 
n 

< u 

< Eh 

o >^ 

< Eh 
O C 

u o 

Eh C 

Eh W 
O >i 
Eh CJ 

< >i 
CD ^ 
O O 

< Q) 
Eh ^ 

< M 

S3 

O (J 



cn < 

O 
£h 
O 



)h 
d) 



ID 

> 



O £h >, 
in O 



o 



Eh O 
CJ U 
O D< 

Eh ft 
<< (A 
C5 < 

33 

CD CD 

Eh m 
U ^ 
O < 

< U 

o <y 

Eh CO 

CD tP 
CD V4 

< < 



0) 



O C 
O CD 
O P 

o u 
< < 

CJ v< 
CD 0) 
<< CO 



^ < >1 

O CD rH 
W CD CD 

< u 

CJ Q) 
Eh to 

Eh O 

CJ 

Eh fl) 
Eh x: 
Eh O* 

#^ tH 

CD CD 



O ^1 
U 0) 
Eh to 


Eh ^ 
Eh nj 
CD > 


< to 

< tJ 


Eh 10 

CJ r-i 

CD < 


< Eh 


< (0 


< o> 

CD U 

< < 


CJ (0 
•< -H 
O W 


4023 
CAC CCT 
His Pro 


CD CP 
tD ij 
O < < 

CJ (0 
O .H 
CD < 


o c 
< ^ 

O CD 


< »H 

O 0) 
Eh CO 


< u 
u <u 

Eh to 


33 

O CD 


CD <-« 
Eh flj 
CD > 


< U 
. CJ 0) 
Eh to 


< <0 
CD < 


Eh U 
O 0) 
Eh to 


< O 
CJ u 


< :3 

Eh 0) 
Eh ►J 


EH rH 

Eh (0 
CD > 


Eh M 
CD Q) 

< to 


33 

O CD 


Eh U 
O Q) 
£h to 



00 CJ to 


oj £h 


m < 


fH Eh 


rH o a; 


rvi CJ 












o 






Eh O 


Eh 


O 


CD 






< o 


O 




Eh 




r f\ 

kD 


Eh 


fH 


CD (U 


O 




r- . 
CH 


< to 


Eh 




O 




< 


CJ o 


Eh 




CD 




Eh 




< 


o x: 


CD 


< Eh 


< 


CD C 


O 


< -H 


CD 


U CD 


<< 




o o 

Eh 

Eh CO 




in 

00 



EH 

£h 
£h 

O 
£h 
< 

CJ 
Eh 
O 

CJ 

o 

CJ 
CJ 

< 

CD 
< 
CD 

CD 
< 
U 

Eh 
CD 

Eh 
< 
Eh 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/0037i 



28/48 



3 


vo H V4 


o ici C 


O 




CVJ < ^ 




ra #< w 


fO O O 










O W 






O >t 


o ^ 


to 


Eh O 


C5 C!> 








0) 


O M 




0) 




O CU 












C5 0) 


> 


o o 


< CO 




H U 




0) 


O 0) 


< t/) 


CO 


< W 


C5 < 


• . 


\D C 




x: 


< <-l 


0 u 




o o 




f n 
UJ 


r 1 1 


r m w 
CH P 








V-/ 








CJ Li 


Fh CI. 




a <u 


< CO 


< 




0 -< 


Li 


U Vi 




0) 


O 0) 


C!) Q) 


CO 


< CO 


< CO 


i\\ 


v*i V\J 


r ) n 


jC 


n U rH 


0\U u 




(N C3 i< 


CM 0 04 








H 








H rH 


tD <D 


s 


< M 


CO 




O W 


(u 


o 


U 0) 




•J 


E-i CO 


S M 


o 








O U 






a < 










x: 


CJ 0) 


0 .-I 


H 


icC CO 


0 0 


<^ 


to ^ 


F-l U 




< <H 


0 <u 


O 


O C!) 




c" 


H 0) 




f— 1 










0 > 




H V4 


0 E-« 




CD <D 


Eh U 


> 


< CO 






EH a 




>1 


< w 






C5 < 


0 0 



r> < ^ 
o U O 

H (0 
O ^ 
O < 

o x: 

< Eh 

35 

U O 
Eh O 

a U 
CJ cu 

eg 

U O4 

< o 
o cu 

< o 

U Oi 

Eh O 
O U 
U O4 



< O 

O Cm 



o 

Eh 
< 

u 



Eh 
U 
S 

>H 
fH 



00 Eh 



U 
0) 



00 O 



O 3 

o o 

< CP 

< 



ID U 



o to 



3=^ 



o 
u 

O O4 



£h 
Eh 
U 



3 
•J 



Eh .-h 
Eh flJ 
O > 

O C 

< rH 

u o 

U rH 

Eh nj 
O > 

O Cn 

< < 
O C 

i< rH 

u o 

Eh rH 
Eh <0 
O > 

< (0 

u ^ 
o < 

in Eh n) 
IfJ U <H 
O < 

Eh C 

< < 

Eh «0 
O > 

< <a 

O rH 

o 

Eh Id 
O .H 
O < 

35 

o w 



Eh O 
U 04 

o ^ 



SUBSTITUTE SHEET 



^^^WO 92/13103 



PCT/US92/00376 



29/48 



ft 



vo 6-1 
n O 



u 

0) 
CO 



o 



0^ U 
O U 

in o 



to 



0) 



El 0) 

< P 

E-« 0) 
El 



U 
< 

U 

El 
U 

E-« 
< 
O 



Eh 

a 

CO 

< 



in o 



(0 



Eh <U 


(A 




< >i 


Eh 04 


< »J 


< >i 


CD C 






o o 


o o 


Eh a 


< 0) 


< to 


£h .H 


o < 


< M 


< o 


Eh <a 


u u 


Eh 




Eh Pu 


Eh V< 


•< O 


o x: 


O J-i 


< Eh 


O 


Eh Jh 


O 3 


O 0) 


< ^ 


< CO 


O O 


as 


£h a 




< (0 




o »< 






o Si 




< H 


S5 



in < 
O 
O 



V4 



Eh (0 

o < 

Eh U 
O 0) 
i< CO 

o h:i 

O >-( 
O 0) 

< CO 

O ^ 
U OJ 
Eh CO 

< ^1 
Eh CO 



Eh 



to 



^ U JC 
VD < Eh 

as 



Ft c 
to 



O C5 



E^ C 
Cfl 
< 



<< CO 

o < 

Eh C 

O O 

r» Eh i-H 
»H Eh <a 

\D O > 

< o 

CJ di 
Eh O 
U 

O Eh 
Eh W 

<: s 

< o 

Eh .H 

< M 

< D> 

< < 

Eh <U 
£h .J 

CJ CD 

C3 -H 
Eh (0 
C3 > 



CO W 




o\ ^ >^ 


in Ki; rH 


VO ic^ h1 


CD CD 








Eh <U 


< «-« 


Eh r-i 


CD C3 


< M 


< (0 


H a 


U iH 


< CD 


C3 << 


CD < 


O 3 


H £X 




< CO 


CD O 


CD < 


CO 


Eh a 




< CO 




CD < 


CD 3 


Eh a 


«< ^ 


•< CO 


O CD 


CD < 


35 


< Jh 




O 0) 


U CD 


Eh CO 



o c 

CO 

< 

CD CD 

^ Eh C 
r>- «C CO 
^ < «< 

< U 
O 0) 
Eh CO 



o o 

Eh CO 

3S 

CD CD 



Eh a 

< CO 
CD < 

Eh a 

< CO 
O < 



in 



< 
Eh 
Eh 

< 
Eh 
O 



(1) 
>^ 

(1) 
•J 



< CO 
CD < 

CD CO 

3^ 

CD O 

Eh ^ 
U <D 
Eh CO 

Eh a 

< V) 

o < 

Eh 0) 
Eh ^ 

< M 

Eh U 
U Xi 

< H 




Eh CO 



fH CO 

CD CO 

35- 

o x: 

< Eh 

< O 
O 

CD EH 
Eh W 

< S 

cn O <o 



O 
CD 

Eh 
O 
Eh 



U 
CO 



Eh a> 
Eh ^ 

< M 

Eh Q) 
Eh ,h 

< H 

Eh CO 
CD >i 
Eh CJ 

CD CD 

3S 

CD CD 

< 3 
Eh <u 
O ^ 

< 0) 
Eh ^ 

< IH 



SUBSTITUTE SHEET 



wo 92/13103 



30/48 




O CP 

o u 

< < 

< «J 



C3 
O 
Eh 



> 

o 
u 



o 



E-i O 

U 04 




o 
o 

< 



o 




iH #15 «H 

tn O O 

o 
a 



o 
u 



O 04 

o o 

E-« 0) 
H .J 

O CP 

< < 



3 



c 
< 



55 

o o 
<< u 

Eh W 
< O 

U 04 




E-« O 

U 04 

O 3 

Eh 0) 

O .J 

O C 

o o 

EH U 

ID OJ 



CO *i 3 

VD < fH 
O O 

Eh 

Eh (d 
O > 

Eh CO 
O >t 
B O 

Eh U 

< >i 
Eh Eh 

O r-H 
Eh (d 
O > 

O CP 
O U 
O < 

o 

o u 

CJ 04 

O Eh 

< s 
EH a 
o < 



rH Eh 
"T < 



(0 



O rH 
O O 

o o 
u u 

U 04 

b 

U J3 
Eh 

Eh 0) 
Eh £ 
Eh 04 

Eh U 
O 0) 

!<< CO 

Eh rH 
Eh (d 
O > 

Eh CO 

o ac 



3 



CM 

o O O 
in 

u <u 

Eh rH 

o x: 

< tH 

< 3 

Eh <U 
U iJ 



Eh 
< 



V] 



Eh 

O 0) 

< to 

< 73 
Eh <U 
U ►J 

Eh Vh 
O 0) 
Eh CO 

< Vl 
CJ Xi 

< Eh 



in Eh 
0> U 

< 



u 

Eh 



O ^4 

U 0) 

Eh CO 

Eh Q) 

u c 

< o 



Eh 
U 



O 

V4 



U 04 



< 



u 

Eh 



C3 >i 
CJ O 



PCT/US92/00371& 



5076 
CAG TCA 
Gin Ser 


5130 

GAT GAG 
Asp Glu 


< (d 

U rH 
O < 


< u 

O XI 

< Eh 


< >i 
O rH 

o o 


Eh 

O 0) 
< CO 


C5 rH 

o o 


< CP 

< < 


CP 
<: < 


O >, 
O -^ 
O O 


Eh rH 
Eh <d 
C5 > 


33 

O O 


< >i 

CD rH 

O O 


< ^1 

< Eh 


33 

C5 C3 


Eh O 
O U 

CJ 04 


5049 
GCT GGA 
Ala Gly 


fn Eh Q) 
O Eh rH 
rH < M 

in 

O U 

o x: 

Eh 


TTA GCT 
Leu Ala 


Eh a 

< CO 

o < 

< CP 

O »H 

CJ < 


AAT GAG 
Asn Glu 


•4; CO 

0 0 


< o 
u u 

U 04 


Eh <U 

Eh x: 

Eh O4 


Eh 0 
U ^1 

U 04 


33 


U U 
U 0) 
Eh CO 


Eh >i 

0 rH 

0 0 



SUBSTITUTE SHEET 



>W0 92/13103 



PCT/US92/00376 



31/48 



5184 
AAT AAA 
Asn Lys 


5238 
AAA GGG 
Lys Gly 


< (0 

o < 


u o 
o u 

U P. 


Eh a 
o < 


O Eh 
Eh Cd 
< S 


iD P 
Eh Q) 
Eh h3 


Eh «J 
O fH 
O < 


O O 


EH »-l 

U <D 
Eh W 


Eh O 
U U 
U 


Eh C 


< 0) 
Eh rH 
M 


Eh 0) 
Eh .-H 
M 


o u 
o jc 

< Eh 


U 01 
O >t 
Eh U 



in 
in 



< 
Eh 
U 

Eh 
U 
Eh 



(0 
> 



o a 

Eh to 



«H #<3j i-H 
fNJ O O 

in 

o < 

Eh 3 
Eh <U 

< M 
EH ft 

< 01 
O < 

Eh >, 
O 

O O 
—I 

o o 
o o 
o < 



(r> O rH 
CJ O < 

in 

35 

o o 
o c 
u o 

Eh rd 
O > 

O C 



o 
o 
< 

Eh 

3 



a 

01 

< 

Eh 
M 
S 

0) 

rH 
M 

01 

•J 




5265 
GTG AAA 
Val Lys 


5319 


Eh 
O 

U < 




O 0) 
Eh 

Eh 0^ 




Eh O 
U CU 




O 01 

33- 




U 01 

o a: 




Eh >-i 

o <u 
< tn 




fa; 01 
< >i 
<^ 





O 0) 

33- 



10 



Eh 

U <U 
Eh CO 

O U 
U 0) 
Eh CO 

O <0 
O .H 
O < 

H 

O <U 
Eh CO 



O 
O 

tn 



CO 



< 
Eh 
ID 

Eh 
CD 
U 



> 



< U 

u x: 

< Eh 
ID CP 

< < 
Eh ^ 

35 

O O 
Eh U 

< Eh 



Eh C 
0) 
< 



3 



35 

a tD 



< 
o 

n CJ fl< 
in 

< 
Eh 



O 



0) 



Eh O 
U ^ 
U CU 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 ^' 



32/46 



in 
in 



< 

< 
u 



Pi4 

(A 
< 



U 0) 



(0 

> 



O 3 

U r-t 




E-« W 



CO 

o < 
in o 
in 



3 



(A 

c 



U 0) 




in 



r .J 


VLP M 


r \ t • 


U; 


O P4 


VD Eh W 




in 








^ VJ 




O < 




H C 


^ to 






•-^ 




< CP 








C) id? 




u 


O •H 




U «. 


v/J 




El Q) 




S J3 


E-i 






E-< W 


O 0) 


O >i 


< to 


b< O 




U U 


o ^ 


< >. 


o o 


Eh 


< IT» 


E-« O 






*<: *< 


U CU 



in o 

in O 
in 

< 
< 



(0 

> 
< 



•< to 
o < 

35 

Eh C 

< O 
O >-i 
O Ok 

cj a 
o tn 



<n Eh >^ 
CO O 
in < Eh 
in 

< >i 
O tJ 

o o 

Eh 0) 
Eh rH 

< M 

B2 



o 
o 

Eh 



U 

x: 
>1 

Eh 



Eh tn 

O X 

£h W 

CJ as 



O Eh flj 

U rH 

O O 
in 

O (0 

35 

< < 

O <1) 
Eh to 



0 

•J 



< tn 

ID < 
Eh 

Eh «d 
ti) > 

£h a 
oa: to 
o < 



tn 

Eh 

o 

Eh 
< 
O 

Eh 
Eh 
Eh 



a 
to 

< 

to 
< 

a 

(0 

<: 



Eh a 

tci: to 

o *< 

< 3 
Eh 0) 
O h:j 

Eh U 

U Q) 

Eh to 

Eh U 

< to 



CD 
Eh 



:3 

0) 



<^ o to 

CM -H 

r- a K 
in 

a 
(J 
< 



u 
to 



u jC 

< Eh 

Eh rH 
Eh 10 
O > 

to 
>i 

< h4 

Eh 10 
U rH 
t5 rt: 

O 0 
< 

O CD 

< U 
U Q) 
Eh to 

35 

O tj 

o (0 
<^ >i 
vo < »J 
in 

35 

O O 
•(^ to 

10 

CJ rH 

o 

o to 

3& 



U 



< < 



< 
Eh 



3 
0} 



Eh J 



Eh 

35 



SUBSTITUTE SHEET 



' >WiO 92/13103 



PCT/US92/00376 



33/48 



CD 



5778 
GCA AAG 
Ala Lys 


5832 
ACT TTT 
Thr Phe 


5886 
AAG TTA 
Lys Leu 


5940 
TCT CTG 
Ser Leu 


5994 
CCT ATC 
Pro He 


0^ M 


o ^< 

U Q) 
Eh W 


o o 


U Q) 
Eh t/3 


% *H 

o o 


U r-i 

o < 


35 

U O 


Eh (X 
< W 
O < 


Eh C 


Eh C 


O O 


>1 


Eh M 
O J2 
< Eh 


Eh w 
U E 


35 

O O 


u x: 

< Eh 


o c 
u o 


< 10 
O < 


t^ U 
O Q) 
Eh W 


< CO 

< >1 


O W 

5& 


H 3 
Eh <J) 
U i-J 


< fl3 

CJ rH 

O < 


Eh <D 
Eh 

Eh ft, 


Eh C 


Eh C 


< <U 
Eh ^ 

< M 


O >^ 
O O 


O CO 
O >i 
Eh O 


u c 

3S 


Eh «J 

O rH 
O < 


U 0 

o u 


< tJ^ 

< < 


Eh rH 
Eh «0 
O > 


o c 

35 



in o 0) 
H cn 

in 

35 
35 

CJ o 

u c 

35 

U >H 
O 0) 
Eh CO 

u x: 

< Eh 

< :3 

Eh 0) 

H 

< >H 

o xi 

< Eh 




tD >H 
CJ < 

Eh C 
3^ 

•< a) 

Eh ^ 

< O 
O >H 

CJ cu 
o c 

O C3 




n «< 
tH O 
ON O ft^ 
m 

Eh 
O 
< 



O 

V4 



Eh 



CJ Q) 

Eh cn 

O 0) 

Eh CO 

CD C 

< -H 

CJ C3 

CJ O 
O 

CJ ft* 



Eh C 

35 

C!) O 

Eh Q) 
Eh ^ 

< M 

Eh ro 

CJ rH 

C!) < 

Eh x: 

Eh ft* 

Eh C 
CO 
< 

o c 

< r-i 

O CJ 



3 



0\ O O 
in 

35 

o a 

< CO 
CD < 

Eh 0) 
Eh iH 

< M 

CJ a 

< CO 
CJ < 

Eh Vi 
CJ QJ 

< CO 



U 
Eh 
U 



3 
.J 



Eh M 
CJ (U 
Eh CO 

Eh U 
O 0) 
< CO 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/00376 



34/48 



00 

o 



(J) 



o 
vo 



o c 
< ^ 



< 
o 



V4 

10 



E-« O 
O U 

O O 



o 
o 



u 




rg 
O 

tH 
VO 



u 
< 



< rtJ 

u <• 


o o 

Eh W 


35 

U O 


u d) 

Eh Xi 
Eh 04 


£h O 

a 04 


Eh (fl 
O >i 
Eh O 


< M 

35 


H rH 

o > 


Eh U 
O 0) 
W 


rt; o 
u ^^ 
O 


< O 
U 


o u 

< Eh 


35 

o o 


Eh a 

< to 
o < 


< >1 

O rH 


35 



in Eh rH 
Eh «J 
o O > 

vo 

Eh cn 
o ac 

Eh <U 
Eh x: 
Eh 0< 

< U 

u a 

Eh W 



Eh 10 



^ < 0 

in #<S; rH 

rH O O 
VO 

o c 

< rH 

u o 

Eh 0) 
Eh 

O 3 
Eh 0) 
U iJ 

U CU 

< 10 

o < 
Eh a 

< CO 

o < 

35 

o o 

EH U 
U fl) 
Eh CO 

u a 

< CO 
O < 

0\ Eh 0) 
CM Eh rH 
iH < M 
vo 



Eh 
O 



o 

CO 



Eh 3 
Eh 0) 
U (J 

EH 

CJ 0) 
Eh CO 

Eh i4 
O 0) 

< CO 

O D 
Eh 0) 
O 1-3 

Eh 

O 0) 
Eh CO 

EH 

O Q) 

< CO 

CJ c 

3JS 



o 
eg 

VO 



GGT GAT 
Gly Asp 


6264 
CTG ACA 
Leu Thr 


6318 
CCT GAT 
Pro Asp 


O CO 

«• t-H 


Eh a 
< CO 


CJ u 
U 0) 
Eh CO 


U 3 
Eh <U 
U iJ 


35 

C5 O 


< 3 
Eh 0) 
U 


CJ V< 


Eh >, 

ID rH 

O C3 


Eh >, 
C5 rH 
O O 


CJ <u 
Eh CO 




Eh (0 

o ac 


Eh O 

O Pu 


< o 

Eh r-t 

< M 


3 

o o 


C!) CO 

35 


O >t 

o ^ 

O C5 


< u 

O 0) 
Eh CO 


ie£ (0 
•< >. 


Eh >, 

t3 rH 

o c:) 


Eh a 

< (0 




< o 
u u 



o 

Eh 

o 



Eh 
M 



u »^ 
O <U 
Eh CO 

U U 
CD Q) 

< CO 

< 0) 
EH rH 

< M 

EH to 
O 

H U 



C5 Eh 
M Eh W 
(N < S 
vo 

Eh C 

o Jh 
< < 

o o 
o u 

O 

Eh V4 
•< CO 




Eh C 

CO 



CJ 



o 
u 

(M u a< 



vo 



C3 V< 
< < 



O < 



Eh 
Eh 



3 



SUBSTITUTE SHEET 



wo 92/13103 



PCr/US92/00376 



35/48 




GTA AGT 
Val Ser 


6426 
TCT GAT 
Ser Asp 


6480 
TTT CAT 
Phe His 


6534 
CGA ATT 
Arg He 


<; M 


O 0) 
H CO 


oj: o 


< o 
u cu 


CJ M 

U 0) 
H CO 


Eh <0 
u ^ 


< >-i 

O 0) 
Eh CO 


o o 


£h C 
§< 




< >i 
O O 


< >1 

< ►J 


< m 
o < 


< CP 

< < 


Eh 0) 
O 


Eh C 

5.2 


£-« >i 

o o 


o <u 

Eh to 


Eh 

O Q) 
Eh CO 


Eh U 

o a> 

< CO 


o o 


< p 

Eh 0) 
Eh 


O Q) 
Eh »-^ 
< M 


< Eh 


o c 
u o 


Eh U 
Eh U 


< >i 
O O 


E^ <1) 
Eh i:: 
Eh 0* 



H ft 
«< tn 
o <: 

Eh 0) 
Eh s: 
Eh 04 

Eh C 

O O 



m O rH 

n O << 

o 

O < 

Eh fl3 
O iH 
O < 

Eh «3 

O rH 

O < 

35 

o o 

Eh W 

u a: 



Eh 



3 
0) 



Eh ^ 
< CO 




O 3 
E-« Q) 
O ►J 

o yi 

O 0) 
Eh Ul 

Eh 3 
Eh Q) 
O 

U 0) 
Eh -t 
«< M 

U >-i 
U Q) 
El CO 

E^ a 
< yj 
o < 

< 

U (1) 
Eh CO 




EH CU 

o < 

H o 

u u 



< 

Eh 
Eh 
U 



£h 
(1) 



00 Eh J-i 
00 O 0) 
in << CO 

35 



U 0. 




rs3 

VD 



in 

«H 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/0037i " ' 



36/48 



AAA GTT 
Lys Val 


6696 
AAC ATG 
Asn MET 


< >i 

O O 


< (D 
O .H 
O < 


H U 

u x: 

< Eh 


35 

O O 


Eh 0) 
Eh rH 


H 3 
Eh Q) 
U h3 




U O 
U V< 
U O* 


Eh U 
O 0) 
< W 


O C 




O Eh 
Eh U 

< s: 
<^ < c 

t£> < rH 

vo U O 

o ^ 
o o 

U <U 
£h W 

Eh 0) 
H ^ 

< M 

35 

O O 

U Q> 
Eh to 



Eh 

3 



Eh U 
U Q) 



< 



V4 



O U 

tH 



CM 

vo 



3 



W 
C 



u *s: 

Eh ^ 
Eh «0 
O > 

< >i 
O -^ 
O O 

*a: O 

U O4 

Eh 0) 
Eh r-* 

< M 

Eh cn 

< -H 

o ac 

Eh 0) 



Eh 
Eh 



O CP 

<: <: 
o o 

O >H 

o < 

Eh 

U fl) 
Eh W 

U 0) 
Eh ^ 

< M 

< Vh 
U 0) 
Eh M 

Eh O 
U U 

U 04 



O U 
CO O 
VO 



o 

u 



Eh U 
< Eh 



00 O 
in 
00 
VO 

U 
U 
O 



►J 



04 Eh >, 
H O "H 
0% O O 

VO 

<: 0) 

Eh rH 
< M 

35 



r ^ rA 


< 




r 1 Lj 


^ *^ 


CJ 


U 


r 1 tfii 


a J 

1— f 


< << 


wj 




Eh 


0 






U 










04 


< Eh 


f 1 rk 


Eh 








0 








Eh 


CO 




id! 0 


Eh 






f 1 Li 




x; 




U ft 




Eh 


< < 




0 


U 


f 1 /rc 
V-/ ivJ 






XX 


r 1 




< Eh 


0 < 


^ rA 






r . 

fcH r— 1 




a ^ 


Cj /rc 






< 


0 > 




rH < 




\r\ Z—t C\ 
u 1 V^ 


r*^ id^ 


CO 0 x: 


m f J Lj 


r- 2 iJ 


CO Eh 


CO U 


vo 


vo 




vo 


M 


3 


c 


r 1 Lj 
VJ >H 


r 1 








H w 


U 




< CO 




Eh 




id* '1 








0 > 






EH 0 


3 




35 






f-H 




U Q* 




C5 




Eh 


Eh 




< u 


0 0) 




0) 


U Q) 


< CO 


< 


CO 


Eh CO 




Eh 


0 


>t tn 




U 


U 


< >i 


< Eh 


U 


O4 




Eh M 


CJ 


>^ 


C5 


0 0 


0 


0) 


Eh «a 


< W 




CO 


0 > 


< U 




to 


Eh 


u 0 




>i 


0 0) 


Eh M 




•-5 


Eh CO 


U >-i 




V4 


< 0 


U 0) 


u 


(U 


U U 


Eh CO 


Eh CO 





SUBSTITUTE SHPFT 



>. wo 92/13103 



PCT/US92/00376 



37/48 



so < 

< 
u 

El 



U 
< 

U 
0) 
CO 



Eh O 
U U 
O CU 

< Eh 



O Eh 0) 

eg E-« 

O < M 

O Q> 

Eh CO 



U 

3 



C 
(A 



< CP 



o 



< < 



CO «< 
CN U 



o 

>4 



3 
4) 



35 

O O 
Eh U 



O 0) 
Eh CO 


o o 


O 0) 
Eh CO 


Eh a 
< </) 


Eh O 
O ^-1 

O 04 


< :3 

Eh 0) 
Eh ^ 


< 

< < 


Eh 

U 0) 
Eh CO 


< m 

< >t 


H U 
U 0) 
Eh CO 


O C 

o o 


o c 
3 < 


< >i 

o o 


< <u 

E-i 

< M 


Eh O 
O U 
O (i< 


6939 
AGA TCA 
Arg Ser 


n Eh O 
U 

VO 

< 

< < 


r- Eh O 
O Jh 
O CJ 

Eh U 

< CO 


Eh 

O 0) 
Eh CO 


Eh >h 
< CO 


«< <U 
Eh r-i 
< M 


Eh O 

a u 

O (X 


< =J 
Eh Q) 
Eh 


CJ o 


< 10 

o »-* 
o 


< O 
U ^ 


Eh C 

35 


!<: (A 

>' 
< »J 


S5 

O CJ 


< < 


CJ 0) 
< CO 


o o 


Eh >, 
CD <H 
O O 


< u 
u o 

Eh CO 


u to 
o < 


Eh O 
O U 
O CU 


O >i 


Eh O 
O Jh 


U >H 
O Q) 
Eh to 



O C3 



u 



CO 



U U 
O (U 
Eh CO 

O W 

35 

Eh 

O jC 

< Eh 

-< U 
U O 
Eh CO 

Eh (0 
CJ rH 
O < 

Eh 

CJ x: 

< Eh 

Eh M 
O O 

< CO 

Eh O 
U ^ 



U 
O 
Eh 

a 

£h 

< 

CJ 



CO 

u 
o 

CO 
M 

x: 

Eh 



rsj Eh >. 
CO U 'H 
rH O 

< u 

< £h 




< Eh 



Eh 
Eh 
O 

O 

3 



<D 
.J 

C 
CO 



o c 

CJ CJ 

35 

o o 



in a 
in o 

rH < 

O 



0) 
CO 

Eh 
M 



CJ) c 

CJ CJ) 

< CP 

O >H 

< < 

Eh >, 

O 

CJ) O 

< O 

a u 



Eh 

o 

Eh 

< 

< 

Eh 
< 
Eh 



0) 
CO 

U 
JC 
Eh 

>, 
Eh 



SUBSTITUTE SHEET 



wo 92/13103 



PCr/US92/0037t • 



38/48 



to" 



<u 
)^ 

>1 

iH 

o 

en 
>1 
1^ 

u 
<a 

CO 

<d 

i-H 

< 

u 

O 
<U 

< 

o 
u 

<u 

rH 

M 

U 
0) 

w 

CO 

to 
< 
c 

(0 

>1 
»^ 

u 
<u 

CO 

<u 
•-3 



o 

CN 



Eh V< 
U 0) 
B CO 

Eh Q) 
U 

O O 

O > 



EH C 

O Eh 
Eh W 
< S 

o c 
u o 

3S 



Eh 

<: 

Eh 
C5 



4) 
iH 

> 



S2 

U Ch 
< < 



3 



O CJ 



3 

Eh 

S 



V4 

<u 

CO 
CP 

a 



Eh >^ 
U 0) 
Eh CO 

iH i«5 rH 
r) O O 

Eh >^ 
O 0) 

< CO 

•< >i 
O ^ 
O O 

9^ M 
O 0) 

< CO 

<: u 

KJ 0) 
Eh CO 



0) 



a o 



CN Eh 
Eh 

O 



u 

Q> 
CO 

10 



< O 

a u 

< CP 

< < 

CJ 0) 
Eh CO 

< u 
u <u 

Eh w 



a 



o 
u 



o a* 

Eh U 
O <1) 
Eh CO 

lO Eh M 
CM U (U 
■«T Eh CO 

35 

CJ O 
Eh Q) 

Eh x: 

Eh Ph 

< U 
O 0) 
Eh CO 

O < 

Eh >h 
O 0) 
Eh CO 

C5 C5 

O 3 
«< iH 
O O 

o a 



O 

in 



O 
Eh 
< 

< 



Eh 
M 

a 



Eh O 
U U 
U A* 

H 3 
Eh <U 

a V< 
O 0) 
Eh CO 

Eh O 
U Jh 
CJ 

Eh U 
O Qi 

< CO 

< 3 
Eh 0) 
Eh .-3 

Eh «J 
t3 > 

< O 
O U 
U 

E^ 

u x: 

< Eh 

35 

U O 

CJ rH 

o < 
o c 

O C5 

u u 
u <u 

Eh CO 
O CP 



u 

u 
u 



u 
x: 

Eh 

o 
u 



SUBSTITUTE SHEET 



> wo 92/13103 



PCT/US92/00376 



39/48 



7560 
CCA CCT 
Pro Pro 


7614 
CAT GAT 
His Asp 


7668 
TCA GGA 
Ser Gly 


7722 
AGC ACT 
Ser Thr 


O 3 
H <D 
O J 


o >^ 
o < 


CD u 
< < 


< >-i 

Eh <0 
C5 > 


ft Ui 

< >1 
< 


o w 


H C 


< tJ> 

CD ^^ 
O < 


< CP 


o ^ 
o < 


u <u 

Eh »h 
< M 


Eh O 
U ft 


o a 

Eh Eh 


< o 
o u 

U P* 


< O 
O Pa 


Eh Q) 

a 


< >i 
O rH 
O O 


< 

< < 


Eh <U 
U 


o >^ 
O Q) 
Eh to 


Eh >, 
O «-H 
O O 


< >1 

o -« 

o o 


O U 
< < 


< u 

U 0) 
Eh to 


EH flJ 

O .H 

o *< 


^1 a 
< to 
o << 


£h V< 
O Q> 
Eh CO 


< ^^ 

o <u 

Eh to 



r) CD C 
r> i< •H 
lO U O 

Eh ^ 

£H 

CD 



> 



Eh 

U 0) 
Eh to 

CD U 
O a> 
Eh to 

Eh W 

O W 

< >^ 
o ^ 

< Eh 

U U 
O Q) 
Eh to 

< 3 
U iJ 
Eh 

O 0) 

Eh to 



r- Eh 
CO ft 

in 

Eh 
< 



to 

u 
>1 



«H Eh 
O 

vo U ft 

fH 

CD 
< 



O 



u 
to 



CD 

CD CD 

<< 0) 
Eh t-i 
< M 



S3 

CD CD 

Eh U 
U 0) 
H to 



EH 

a 
< 

o 
u 



^4 
o 

V4 



Eh 



CA 



U ft 

Eh U 

O (U 

fa: CO 



Eh V^ 
Eh to 



O 
U 



3 



Eh C 



CD 
CD 
U 

< 
O 
tD 

Eh 
Eh 
< 



U 
< 

ro 



0) 

rH 
M 




o to 

< -H 

O X 

CD 3 

< <H 
CD O 

Eh CP 
CD U 
U < 




7776 
TCC AGT 
Ser Ser 


7830 
ACC AAA 
Thr Lys 


33 

CD CD 


< >> 

CD rH 
C3 CD 


< U 
U 0) 
Eh to 


< ^ 

O 0) 
Eh to 


< U 
U 0) 
Eh to 


Eh <U 
Eh -H 
«< M 


£h rj 
O rH 
CD < 


Eh 

C^ (U 
6h to 


H U 
O 0) 
Eh CO 


O C 

3< 


Eh 3 
Eh <D 
O ^ 


CD ^ 
Eh <0 
CD > 


Eh fl) 
Eh ^ 
< M 


Eh to 
-H 
O K 



r- Eh 

o 

Eh 



<1> 

to 
u 

CO 



U Q) 
Eh CO 

Eh U 
CD 0> 

< CO 

< >i 
CD »H 
O CD 



Eh 
O 
< 

CD 
ID 



U 

XI 
fr> 

CP 
}H 
< 

CP 



CD a 

CD V-i 
Eh Eh 




CD 3 
CD O 



eUBSTlTUTE SHEET 



wo 92/13103 



PCT/US92/0037& ' * 



5 



3 



(0 



00 < 

E-< 
U 
O 



Eh 



O ^-^ 
O O 

£h CO 

O U 
O 0) 



Id 

> 



U jC 

< H 

O C 

O O 

E-« »^ 

E-t CO 



El 
O 
< 

Eh 
O 



Eh 

)^ 
0) 
CO 



Eh C 

< Eh 
O O 



Eh 
CJ 

Eh 
Eh 
£h 



CO 
O 

x: 

04 



o o 

Eh C 

tn 





4n/4A 




tn 


0^ 






< J 


r- 






C_i t M 




O O 




Eh cn 








Eh Id 




O > 




Eh «5 




U rH 








EH O 








w M-l 




< (0 




o ^ 




r n 




O B 




EH W 




id! S 








O O 




EH U 




< >i 




Eh Eh 


in 6^ 0) 


\o *-H 


a\ •< M 


r- 






< 7t 




Eh a> 




O 




Eh 




u x: 




< Eh 




o w 








33- 




< u 




o <u 




Eh cn 




33 




o o 




Eh to 




U .H 




O < 




EH >1 








O O 



o 

00 



Eh U 
U 0) 
Eh to 



o 

u 

Eh C 

3S 



3 



tn 

0) 



Eh 
Eh 
< M 

O O 
CJ J-i 

Eh W 
O >i 
Eh U 

CJ a 
o < 

r-l < t-H 

O O O 

00 

Eh 0) 
Eh .H 
#< M 

Si! 

C5 ^ 
Eh «J 
O > 

o a 

CD V< 
Eh Eh 

Eh rH 
Eh rtJ 
O > 



V) 



3 " 



O < 

o o 

< u 
a x: 

< Eh 



O O W 
O *C >i 

.H ;^ J 

00 

33 

O O 

<< V< 
CJ 0) 
Eh CO 

Eh M 
Eh rtJ 
O > 

Eh U 

<j: CO 

o a 

< (A 
C5 < 

Eh 0) 
Eh ^ 

< M 

O rH 
Eh to 
O > 

C5 O 
O U 

CJ 04 



O 
CO 



O 

U 04 



Eh 
O 



U 
Eh 



3S 

Eh >, 
O CD 



o 



43 
Eh 

O 
>H 



O 04 

Eh U 
O <U 
Eh CO 



< 



CP 



< < 

CD <-i 
CD CD 



"^J- Eh >i 

»0 O rH 

rH CD CD 
CO 

O *H 

Eh <0 

O > 



Eh C 

3S 
35 

U CD 

< (0 
< 

< (0 

o < 

CD C 

rf! n-t 

3S 

Eh a 
■<< (A 
O < 

tn 

< u 
CJ a> 
H CO 

Eh Q4 

< tn 
o < 

031 tn 

y c: 
< 



CO 



3" 

< O 
U ^1 

O 04 

Eh n 

U rH 

CD < 



SUBSTITUTE SHEET 



• >WG 92/13103 



PCT/US92/00376 



41/48 



it 




e> 3 

U 

o u 

u < 

H c 

o o 

Eh Q) 
Eh 

Eh >i 

O >-* 



00 
CO 



Eh 
O 

O 
O 

Eh 
O 
U 

O 
Eh 



/-H 
(0 

> 

Eh 

D> 
U 
< 

B 



o o 

U (X* 

Eh rH 

Eh rtJ 

O > 

Eh 

t3 Q) 

< W 

U 

O O 



M H c 

vo en 

CM i< <; 

00 

O O 

< >i 

o o 

< o 
o a. 



< u 
o < 

Eh O 

O M 

U 04 

O eO 

U i-H 

O < 

EH a 

< I/) 



Eh 
O 



rt3 
> 



vo < o 
^ u u 

n U 04 

CO 

O >H 

o x: 

Eh 0> 

o ^ 
u < 

33 

o o 

Eh 10 
O > 

< 0) 

t4 rH 

n 

Eh O 
O >H 
U O4 

H >H 

O <U 
«< W 

O O 

Oi Eh C 
00 (/) 
rg < 
00 

Eh U 
Eh 

O O 

< M 
O 0) 
Eh W 

(CJ: i-H 

Eh m 
O > 

Eh O 
U U 
O Cti 

U .-H 
Eh « 
O > 

Eh O 
U U 
U O4 



8370 
GCT GCC 
Ala Ala 


8424 
GAT AGC 
Asp Ser 


Eh nj 


< «a 

U rH 
0 «, 


Eh »h 
< Eh 


u 

< W 




U »H 

U Q) 
< CO 


Eh 

0 0) 
< W 


U) 

< >> 

< *^ 


Eh 0 
a ^1 
U (I4 


0 IJ> 
< < 


< U 

a 0) 

Eh cn 


H 0 
U ^ 


Eh 

0 <U 
< cn 


0 »H 

0 <U 

< Ui 




Eh W 



O U 
O 0) 

< W 

< CO 



Eh 
Eh 



3 



C 
< 



O 
Eh 

Eh 
C3 

O 
Eh 
Eh 



U 
CO 

0) 
CO 

0) 

x: 



0\ O 



00 



o 
u 



3 



c 

(A 
< 



<J U 
< >. 
Eh 

H C 

3S 

Eh 0) 

H O4 

Eh O 
U U 
U 0. 

H >H 

o x: 

Eh <0 
O > 

C5 >H 



SUBSTITUTE SHEET 



wo 92/13103 PCT/US92/0037tt 



42/48 









CM O 




















in U 


< 


CO 






CO 






r 


l/J 


r h 


tf\ 










r>i 


















r ■ 


O 




o 




O 


t . 
M 






u * 


CJ 


cu 






C 


Eh 








(/) 


O 


Q) 








< 


W 




o 


c 




C 






CO 




f-H 






< 


u 


o 








r \ 
\J 


1 . 

w 






(A 


w \ 


«c 










r ■ 
fcH 




• 

O 


rH 


«i 


>^ 




Eh 




ID 






O 


> 


O 


o 






o 


fcH 


1 , 
W 






}h 


o 


0k\ 
Ql 








l< 


CO 




E-» 


U 


o 


M 




O 




o 


At 






^ 


Eh 


cn 


fH 


< 


o 






ID 




u 


o < 


#H 






cu 


in o 


O 


00 






00 








0) 




Sh 




Eh 


1— 1 


o 


»C 






1— 1 




Eh 








D 


> . 










At 




o 


o 


i< 


W 




tH 


1 , 

w 




o. 




p % 


At 




Uj 




H 


CO 


o 








/N 
Vj 












r 1 


»^ 




o 










o 










o 


)H 








o 










H 




u 






U 




u 


(1) 




O 


< 


H 


CO 






u 


H 


a 




o 


0) 


< 






E-i 






< 








< 


CP 




O 












Eh 


u 





3 



O > 



in a 

00 

< 



u 
to 



o > 

O iJ 

O U 

< >i 

H 

U G» 

H CO 

O >i 

O O 

El M 

O 0) 

Eh CO 

E-i W 

•< -H 

U X 



O U 

vo <^ 
CO CJ 
< 
El 
E-« 



o < 
in o 

CO O 
E-« 



E-i 

E-" 
U 

Eh 



O 



CO O 
O 
Eh 
O 
< 
U 

O Eh 
fn O 
r- Eh 
00 CD 
Eh 
Eh 
Eh 
U 

tH 



o 

CO 



< 
Eh 
O 
< 
Eh 

O 

Eh 
£h 

O < 
rH Eh 
< 

00 a 

U 
O 

3 

O 
O 



o 
o 

CO 



Eh 
U 
Eh 
Eh 
CD 
Eh 
Eh 
Eh 

Eh 



O CD 
a> CJ 
vo O 
00 << 
O 

Eh 
Eh 
O 
Eh 



o 


p-( 


o 




CM 


Eh 


o\ 


s 


CO 




CO 


O 


CO 


< 


CO 


O 




Eh 




< 




O 








Eh 




< 




< 












< 




< 




u 












Eh 


00 


Eh 


CO 


Eh 


CO 




CO 




CO 


< 




O 




U 




Eh 
















E-c 




CD 




O 




Eh 










O 


O 


o 


< 


O 


< 






CO 


Eh 


CO 




CO 


O 


CO 






H 
















H 




Eh 




O 








Eh 








< 







O 




o 




O 


vo 


r- 


% 


CO 


CO 




CO 




o 






o 






o 




o 




O 


CO 


CD 


in 






CO 


CO 




CO 



o 

Eh 
Eh 
CJ 
CD 
Eh 

Eh 

% 

Eh 
Eh 
X 

a 

O 
H 
Eh 
O 



o 


to 


O 


r» 


CD 






O 


CO 


00 


< 


00 




Eh 












CD 






CD 












< 




o 


CD 


o 


vo 


Eh 






Eh 


CO 


CO 


O 


CO 




Eh 






O 






< 






o 






CD 






CD 





CJ 

u 

CJ 

u 



Eh 
CJ 



SUBSTITUTE SHFFT 



» *W0 92/13103 



PCT/US92/00376 



43/48 



it 



o o 


o o 


VD Eh 




C\ Eh 


O U 


CO 0^ 


cn O 








Eh 




O 




Eh 


o 






O 


o o 


O £h 


in < 




a\ u 




CO E-* 


cn Eh 


< 


£h 












Eh 




< 


o 





o 
o 

r-l 



O 

cn 
o 
cn 



O 

I 

Eh 
< 

a 
o 

Eh 

u 
< 

Eh 
Eh 
O 
O 
Eh 
Eh 




Eh 

< 
< 



o 



< 

cr> u 
O 
Eh 
O 

< 



o 
If) 



O 



o 
n 

iH 



o 

CN 
rH 



O 
rH 
iH 



Eh 

< 
O 
Eh 
O 

o 

Eh 
O 

Eh 
< 
Eh 
£h 
O 
< 

Eh 
< 
O 
£h 

< 
u 

Eh 
Eh 
Eh 

o 

Eh 
Eh 

O 
Eh 
< 
O 
Eh 
O 




SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/0037(S 



44/48 



o u 
a» < 
in u 

O 
6h 

O 

Eh 
U 

o < 
CO O 
in 
a\ 
O 

Eh 
Eh 



O Eh 
\0 Eh 
vr> Eh 

a» < 

E-« 

Eh 
< 

o u 
in E-« 
vo U 

H 
Eh 
< 
O 
Eh 



o 
n 



o 



o 




O 


O 


o 


Eh 


o 


r- 


s 




H 






CO 


in 


o 






r- 


< 




oy 




a\ 


EH 


o> 


u 


o\ 






H 




Eh 














U 










Eh 




O 










O 




< 




















< 








o 




o 




o 




o 


Eh 


o 




EH 


r> 




o 


O 




in 


O 






r- 


< 






EH 


<j\ 




a\ 




CI 




O 




o 




u 






<< 








Eh 






Eh 




O 




U 






Eh 




CJ 




< 






O 




H 














H 








o 


u 


O 


< 


o 


U 


o 


in 


u 




Eh 


<^ 


Eh 


vo 


in 


EH 


vo 


U 


VD 


H 


r- 




Eh 


a\ 




ON 












Eh 














H 














O 










EH 














O 




O 




U 





o < 

00 Ki: 

ON Eh 
E-f 
U 
Eh 

1^ 

U 

O £h 
ON U 
r- U 
ON Eh 
Eh 

I 

Eh 



< 

U 

u 
u 
o 
o 



I: 

u 



o 


< 


o 


o 


o 




O 






t-« 




00 




in 


in 


VO 


H 


VO 








3 


ON 


EH 


ON 


EH 




EH 




ID 




Eh 






EH 




O 




O 






< 




O 




< 














u 














u 






EH 








Eh 





o 


< 


O 


u 


o 


Eh 


o 


m 


EH 


o 


Eh 




Eh 




in 


U 


VO 


Eh 


VO 


Eh 




ov 


EH 


ON 


< 


ON 


Eh 






Eh 








< 






U 




u 




>* 


































U 








:^ 






Eh 




H 




e 





Eh 
U 

o 



Eh 
O 
U 
O 

U 

O 



I 

u 
o 



O O 

o 
ON »>a: 

ON Eh 

s 

£h 
< 

Eh 
O 

O Eh 

n U 

0\ O 

ON U 

U 
H 

*3: 

H 
Eh 

ON Eh 
ON O 
< 

o 

Eh 

O < 
.H U 
ON l< 
ON O 
Eh 
U 

O 
O 



o 
o 

ON 
ON 

o 

O 
Eh 
H 

o 

Eh 

O Eh 
ON U 



00 

o^ 



Eh 
Eh 
O 
Eh 

s 

Eh 
Eh 



O O 
CO O 
CO O 
ON O 

Eh 
U 
Eh 
O 









§ 




CJ 




Eh 




O 




O 




Eh 


CO 




Q» 


U 




o 




o 




Eh 




Eh 




Eh 




a 




< 


o 


u 








< 








1 




at; 




Eh 


o 


O 


VO 




G\ 




o\ 


o 




Eh 




O 




< 




u 




o 






o 


o 


in 


Eh 


o\ 


U 


a 


Eh 




»< 




O 




o 




o 



SUBSTITUTE SHEET 



* « wo 92/13103 



PCT/US92/00376 



to* 



o 
in 



o 



o 
o 



O 
CM 
rH 
O 
rH 



45/48 
o 

U 

u 

O 
O 

O 

o 
< 
o 
o 

< 

u 
< 

< 

E-« 
O 
O 
O 

O 

O 
E-« 
O 



O 
•H 

«H 
O 
iH 



O 
O 



O 

o 
o 



3 

O 

< 

o 
o 
o 
o 

Eh 

U 
Eh 
Eh 
U 
< 

< 
Eh 
O 
O 

< 
Eh 
O 
O 
Eh 
O 
Eh 
Eh 
Eh 



O < 
CN Eh 
(N Eh 

31: 

o 
< 

O CD 
iH Eh 
<N O 
O O 



O 

o 

CM 
O 



O 

cn 
o 



< 
Eh 

3 

Eh 
Eh 
O 
< 
Eh 

Eh 
Eh 

< 

3 



o « 

CN O 



Eh 
< 

o 

O Eh 
CO O 

r> O 
O Eh 

o 
o 

U 



o 



n O 
o < 
U 
< 
Eh 



3 3 



SUBSTITUTE SHEET 



wo 92/13103 



PCT/US92/0037^*' 



46/46 



li 



in 

o 

X 



EXON 14 

EXON 13 
EXON 1 2 

EXON 1 1 

EXON 10 
EXON 9a 

EXON 9 

EXONO 

EXON 7 

EXON 6 
EXON 5 

EXON 4 

EXON 3 
EXON 2 
EXON 1 



SUBSTITUTE Sh'EET 



wo 92/13103 



PCr/US92/0037g 



g 

U 

o 

H 

H 
O 

H 
O 

o 

H 
O 

u 
o 



u 

< 
o 
o 

< 
O 

o 

O 

O 
U 

O 



< 
H 
H 
O 
O 
H 
O 
H 

< 
H 
U 
H 
< 
O 

u 

< 
H 
H 
< 

O 
H 
O 
< 



< 
H 

H 

H 
H 
O 
H 



48/48 



< 
O 

O 

o 
< 

< 



U 

< 
E-< 

E-i 

< 

E-t 
E-H 

E-H 



E-^ 
U 

O 
H 
E- 

O 
E- 

o 

E-H 
O 
O 
< 

u 

E-t 
H 
E-t 

^ 

tH 

E-« 
< 

o 



< 
U 
H 
H 
H 
< 
< 

H 
< 
E-^ 
< 
< 

< 
E- 

a 
< 

o 
< 

E- 
H 
H 
< 
U 



E-H 
H 

f-H 

a 
< 

Eh 
H 

E-1 

fH 
O 

< 

E-i 
< 

O 





O 


CO 


o 


O 






< 


< 


< 


< 


< 


< 






o 


Fh 




o 




o 


< 


E-i 




< 


o 


o 




u 








E-H 




f-i 


u 


E- 










^H 


o 






E-i 


E-« 


E-H 




o 






< 




U 




o 


< 


H 


< 




t-i 


f-H 






E-« 


H 


E- 


Fh 




c- 


^H 


TT 


f-H 


e^ 










E-H 


< 


a 


< 






o 


< 






O 


u 


< 


o 


H 


H 


u 


E-H 


H 


(J) 


o 


O 


H 


t-H 


E-H 


O 


H 


o 


f-1 


< 




O 




^ 




O 




< 


E-i 


O 


E-^ 


H 




E-H 


U 


< 


H 


U 


E-H 


< 


< 






< 




< 


E-H 


O 


< 


u 


E-H 




H 


O 


O 


O 


H 


H 


u 


O 


H 


O 




H 


H 


E-i 


E-H 


% 


< 


O 


H 


O 


E-H 


O 








O 


O 


e-1 


u 


E-H 




H 




H 




^H 


H 


< 


H 


H 


< 




< 






H 


H 


E-« 


o 


o 




< 


< 


H 


H 


E-H 


E-H 




E-H 


< 




< 




< 




E-H 


E-H 


^ 


U 


E-i 




< 






E-< 


O 


E- 




E- 




U 






< 


H 


U 




o 


U 






H 


H 


O 


< 




< 


E-H 




< 
E-H 




o 






< 


E-H 


U 


E-i 








O 


E-H 








< 




E^ 




H 


< 


E-H 




O 




O 




H 


< 




E- 


< 


o 


< 



4. 



FNTERNATIONAL SEARCH REPORT PCT/US 92/00376 

loc«rnatioaal AppJksdoa No 



I. CLASSinCATlON OF SUBJECT MATTHl (if %wm%l ctMsificaaoo ty bols >»>ly. in^icx^ alQ* 
AcoDrdtSf (o lac«niatiottal Pxtcat Ousficftdoa (IPC) or to both Nsdooal QassificHtioa mtt4 IPC 

Int. CI. 5 C12Q1/68; C12N15/12 



II. FIELDS SEAKCHED 



MiaimsB DoauBCBUitoa Scar 



CUssification Sysicn 


Qusificatioo Symbols 


Int.Cl. 5 


C12Q ; C07K ; GO IN 



DocuBcntAdoQ S«ucM other chu MiaUmum Oocumoutioa 
to tb« Ext«fit thai such DocucmIs ir« Ibc1u4«4 ia th« Fwltfs Scarch«4" 



m. DOCUMENTS CONSIDEBED TO BE RELEVANT* 



Category " 



Cttadoo ai Docamvit, " with iadicatioo, wbcr* appn>Kix«« ^ ^* rdmat pusa««s ^ 



K«i«vmAt to O&ia No.^ 



WO, A, 8 901 481 (IMPERIAL CANCER RESEARCH 

TECHNOLOGY) 23 February 1989 

see page 2, line 15 - page 6, line 12; claims 

BIOCHEMICAL AND BIOPHYSICAL RESEARCH 
COMMUNICATIONS, 

vol, 174, no. 1. 15 January 1991. DULUTH, 
MINNESOTA US 
pages 298 - 304; 

Y.HOSHINO ET AL. : •Normal human chromosome 5. on 
which a familial adenomatous polyposis gene is 
located, has tumor suppressive activity' 
see abstract 

see page 302. line 1 - page 303, line 9 

WO, A. 9 005 180 (THE REGENTS OF THE UNIVERSITY OF 
CALIFORWIA) 17 May 1990 

see page 7, line 25 - page 9. line 28; claims 



26 



26-28 



* Sp«iaJ cmt^ortos of dt«4 4ocamiBts : 

'A' 4ocBiMnt rf«flalog tb« gcMral stato of th« art ithkii Is not 
coAsi4«rt4 to ho of partieular r tfa v m Dco 
«uUcr docomoat hvt pohlishH oo or after tbo totnadoaaJ 

n.' 4ociim€at vfaicfa say throw 4o«hcs oa artortty dain(s) or 
which is dtc4 to establish th« ^bllcatioo 4at« of aaocber 
dtatioa or other s^odaJ reasoa (as sp«dfle4) 

'O' 4ocaiBeat rcferrlag to «a oral iisdosura, us«» exhibition or 



^oeofBcat pubilsheri prior to the lateniatiooaj fiUag data but 
later than the ^ofity tfate daiaerf 



later rioauant poUisfaed ifter tha iatemactoaal mint 
or arionty 4ate tad aot ia ooafUct with the afwficatioa but 
dte4 to aa4ermai the priadple or theory tiatfeiiyiac 
isveatioa 

'X' docu«it of partioular retevaace; the dai»«4 InvntSoa 
cuiaot be ooasidflrei ao««l or canaot be coosMerori to 
iavolire aa taviative step 

-y loauaent of particular relevaaca; the dalaerf lavatioo 
ctoaot be coasiriveri to iavolve aa iaveatlve step «heo the 
iocuaeat U ooiabjoe4 irith one or nora other sa<fc ^p<»-^ 
veacs, soch coaMaatioa bdag obWoas to a praa skiUe4 
la the art. 

'A' tfocaiaeat nesibcr of the sane pateat family 



IV. CXRT1FICATION 



Date of the Actual Coapictioa of the latcreatioaaJ Search 

01 JUNE 1992 



latcnadoaal Searchiof Authority 

EUROPEAN PATENT OFFICE 



Date of MaUlag of this latemadooal Search Report 



0 9 JUH 



Sig^unire of Aathorize4 Offia 

LUZZATTO E,R 




I Pcr/isAyiio i« 



SMJ- 



PCT/US 92/00376 

InmmdooaJ Ayplksdott No 



m DOCUMENTS CONSIDERED TO BE RELEVANT (CONTINUED FROM THE SECOND SHEET) 




Category* 


atadoa of Oocaaicai, with iniiiatiBB, nbar* ■pP">P'<u*< *^ Ktcvul punt«s 


Ralmnt to Ckaim Now 


P.x 


SCIENCE. 

vol. 253, 9 August 1991. LANCASTER. PA US 
pages 661 ~ 665; 

K.W.KINZLER ET AL. : 'Identification of FAP locus 
genes from chromosome 5q21' 
see the whole document 


1.29.32, 
64-66 


p.x 


SCIENCE 

vol. 253, 9 August 1991, LANCASTER. PA US 
pages 665 - 669; 

I.NISHISHO ET AL-: 'Mutations of chromosome 5q21 
genes in FAP and colorectal cancer patients' 
see the whole document 


1,37 


T 


SCIENCE 

vol. 253. 9 August 1991, LANCASTER. PA US 
page 616; 

J.MARX: 'Gene identified for inherited cancer 
susceptibility' 





m PCr/lSAy210 icxtn tMI \Jmaan 



ANNEX TO THE INTERNATIONAL SEARCH REPORT 

ON INTERNATIONAL PATENT APPLICATION NO. US 9200376 

SA 57001 



Th» •noex bsU the patent fmwilf mcmbcrv rdatiac to the patent documents dtmd in the a^ve^mcntioDed iotcrnational seaixli report. 
The ambcn are as contained in tbe European Patent Office EDP file on 

The European Patent Office is inno way tiable for these particulars which are merely ffvcn for the purpose of infomatiott. Q 1/06/92 



Patcat d» 


cnmcat 


Publiction 


Patcat fMBily 


Pabikatien 


cited in scar 


ch report 


date 




dace 



WO-A-8901481 


23-02-89 


EP-A- 
JP-T- 
US-A- 


0376968 
3503838 
5098823 


11-07-90 
29-08-91 
24-03-92 


WO-A-9005180 


17-05-90 


AU-A- 
CA-A- 
EP-A- 
JP-T- 


4635089 
2001815 
0440744 
3505675 


28-05-90 
30-04-90 
14-08-91 
12-12-91 



s 

m 

o 

h I 

o ~ 

w For more details about this aanex : see Official Joinal of the European Patent Office, No. 12/S2 



